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NPCILI (NPC3^ AND METHODS OF USE THEREOF 
This application claims the benefit of U.S. Provisional Patent Application No. 60/397,442; 
filed July 19, 2002 which is herein incorporated by reference in its entirety. 

FIELD OF THE T1WENTION 

The present invention includes NPCILI polypeptides and polynucleotides which encode 
the polypeptides along with methods of use thereof. 

BACKGROUND OF THE EWENTION 

A fector leading to development of vascular disease, a leading cause of death in 
industrialized nations, is elevated serum cholesterol. It is estimated that 19% of Americans 
between the ages of 20 and 74 years of age have high serum cholesterol. The most prevalent form 
of vascular disease is arteriosclerosis, a condition associated with the thickening and hardening of 
the arterial wall. Arteriosclerosis of the large vessels is referred to as atherosclerosis. 
Atherosclerosis is the predominant underlying factor in vascular disorders such as coronary artery 
disease, aortic aneurysm, arterial disease of the lower extremities and cerebrovascular disease. 

Cholesteryl esters are a major component of atherosclerotic lesions and the major storage 
form of cholesterol in arterial wall cells. Formation of cholesteryl esters is also a step in the 
intestinal absorption of dietary cholesterol. Thus, inhibition of cholesteryl ester formation and 
reduction of serum cholesterol can inhibit the progression of atherosclerotic lesion formation, 
decrease the accumulation of cholesteryl esters in the arterial wall, and block the intestinal 
absorption of dietary cholesterol. 

The regulation of whole-body cholesterol homeostasis in mammals and animals involves 
the regulation of intestinal cholesterol absorption, cellular cholesterol traflacking, dietary 
cholesterol and modulation of cholesterol biosynthesis, bile acid biosynthesis, steroid biosynthesis 
and the catabolism of the cholesterol-containing plasma lipoproteins. Regulation of intestinal 
cholesterol absorption has proven to be an effective means by which to regulate serum cholesterol 
levels. For example, a cholesterol absorption inhibitor, ezetimibe ( 
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), has been shown to be effective in this regard. Identification of a gene target through which 
ezetimibe acts is important to understanding the process of cholesterol absorption and to the 
development of other, novel absorption inhibitors. The present invention addresses this need by 

5 providing a rat and a mouse homologue of human NPCILI (also known as NPC3; Genbank 

Accession No. AF192522; Davies, et al, (2000) Genomics 65(2): 137-45 and loannou, (2000) Mol. 
Genet. Metab.7 1(1-2): 175-81), the ezetimibe target. 

NPCILI is an N-glycosylated protein comprising a YQRL (SEQ ID NO: 38) motif (Le., a 
fran^-golgi network to plasma membrane transport signal; see Bos, et aL, (1993) EMBO J. 

10 12:2219-2228; Humphrey, et aL, (1993) J. Cell. Biol. 120: 1 123-1 135; Ponnambalam, et al, (1994) 
J. Cell. Biol. 125:253-268 andRothman, et al, (1996) Science 272:227-234) which exhibits 
limited tissue distribution and gastrointestinal abundance. Also, the human NPCILI promoter 
includes a Sterol Regulated Element Binding Protein 1 (SREBPl) binding consensus sequence 
(Athanikar, et al, (1998) Proc. Nati. Acad. Sci. USA 95:4935-4940; Ericsson, et aL, (1996) Proc. 

15 Nati. Acad. ScL USA 93:945-950; Metherall, et a/., (1989) J. Biol. Chem. 264:15634-15641; 
Smith, et a/., (1990) J. Biol. Chem. 265:2306-2310; Bennett, et aU (1999) J. Biol. Chem. 
274:13025-13032 and Brown, etaL, (1997) Cell 89:331-340). NPCILI has42% amino acid 
sequence homology to human NPCl (Genbank Accession No. AF002020), a receptor responsible 
for Niemann-Pick CI disease (Carstea, et al, (1997) Science 277:228-231). Niemann-Pick CI 

20 disease is a rare genetic disorder in humans which results in accumulation of low density 

lipoprotein (LDL)-derived unesterified cholesterol in lysosomes (Pentchev, et a/., (1994) Biochim. 
Biophys. Acta. 1225: 235-243 and Vanier, et al., (1991) Biochim. Biophys. Acta. 1096:328-337). 
In addition, cholesterol accumulates in the ^ran^r-golgi network of npcr cells, and relocation of 
cholesterol, to and from the plasma membrane, is delayed. NPCl and NPCILI each possess 13 

25 transmembrane spanning segments as well as a sterol-sensing domain (SSD). Several other 
proteins, including HMG-CoA Reductase (HMG-R), Patched (PTC) and Sterol Regulatory 
Element Binding Protein Cleavage- Activation Protein (SCAP), include an SSD which is involved 
in sensing cholesterol levels possibly by a mechanism which involves direct cholesterol binding 



wo 2004/009772 



PCT/US2003/022467 



-3- 

(Gil, et aU (1985) Cell 41:249-258; Kumagai, et aU (1995) J. Biol. Chem. 270:19107-19113 and 
Hua, et cd,, (1996) Cell 87:415-426). 

SUMMARY OF THE INVENTION 

The present invention includes an isolated polypeptide comprising 42 or more contiguous 
amino acids from an amino acid sequence selected from SEQ ID NOs: 2 and 12, preferably 
comprising the amino acid sequence selected from SEQ ID NOs: 2 and 12. The invention also 
includes an isolated polynucleotide encoding a polypeptide of SEQ ID NO: 2 or 12, preferably 
comprising a nucleotide sequence selected from SEQ ID NOs: 1, 5-10, 1 1 and 13. A recombinant 
vector comprising a polynucleotide of the invention is also provided along with a host cell 
comprisiag the vector. 

The present invention also provides an antibody which specifically binds to NPCILI (e.g., 
mouse NPCILI or human NPCILI) or any antigenic fragment thereof, preferably rat NPCILI, 
more preferably a polypeptide comprising an amino acid sequence selected from SEQ ID NO: 39- 
42. Preferably, the antibody is a polyclonal or monoclonal antibody. Preferably, the antibody is 

obtained from a rabbit. 

The present invention also includes a method for making an NPCILI polypeptide of the 
invention comprising culturing a host cell of the invention under conditions in which the nucleic 
acid in tiie cell which encodes the NPCILI polypeptide is expressed. Preferably, the method 
includes the step of isolating the polypeptide from the culture. 

The present invention includes methods for identifying an agonist or antagonist of NPCILI 
comprising (a) contacting a host cell (e.g., Chinese hamster ovary (CHO) cell, a J774 cell, a 
macrophage cell and a Caco2 cell) expressing a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 2 or SEQ ID NO: 4 or SEQ ID NO: 12 or a functional fragment tiiereof on a cell 
surface, in the presence of a known amount of detectably labeled (e.g., with or ^^^I) ezetimibe, 
with a sample to be tested for the presence of an NPCILI agonist or antagonist; and (b) measuring 
the amount of detectably labeled ezetimibe specifically bound to the polypeptide; wherein an 
NPCILI agonist or antagonist in the sample is identified by measuring substantially reduced 
binding of the detectably labeled ezetimibe to the polypeptide, compared to what would be 
measured in the absence of such an agonist or antagonist. 

Another method for identifying an agonist or antagonist of NPCILI is also provided. The 
method comprises (a) placing, in an aqueous suspension, a plurality of support particles, 
impregnated with a fluorescer (e.g., yttrium silicate, yttrium oxide, diphenyloxazole and 
polyvinyltoluene), to which a host cell (e.g., Chinese hamster ovary (CHO) cell, a J774 cell, a 
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macrophage cell and a Caco2 cell) expressing a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 2 or SEQ ID NO: 4 or SEQ ID NO: 12 or a functional fragment thereof on a cell 
surface are attached; (b) adding, to the suspension, radiolabeled (e.g., with or ^^I) ezetimibe and 
a sample to be tested for the presence of an antagonist or agonist, wherein the radiolabel emits 
radiation energy capable of activating the fluorescer upon the binding of the ezetimibe to the 
polypeptide to produce light energy, whereas radiolabeled ezetimibe that does not bind to the 
polypeptide is, generally, too far removed from the support particles to enable the radioactive 
energy to activate the fluorescer; and (c) measuring the light energy emitted by the fluorescer in the 
suspension; wherein an NPCILI agonist or antagonist in the sample is identified by measuring 
substantially reduced light energy emission, compared to what would be measured in the absence 
of such an agonist or antagonist. 

Also provided is a method for identifying an agonist or antagonist of NPCILI comprising 
(a) contacting a host ceU (e.g., Chinese hamster ovary (CHO) cell, a J774 cell, a macrophage cell 
and a Caco2 cell) expressing an polypeptide comprising an amino acid sequence of SEQ ID NO: 2 
or SEQ ID NO: 4 or SEQ ID NO: 12 or a functional fragment thereof on a cell surface with 
detectably labeled (^.g., with and ^^^I) cholesterol and with a sample to be tested for the 
presence of an antagonist or agonist; and (b) measuring the amount of detectably labeled 
cholesterol in the cell; wherein an NPCILI antagonist in the sample is identified by measuring 
substantially reduced detectably labeled cholesterol within the host cell, compared to what would 
be measured in the absence of such an antagonist and wherein an NPCILI agonist in the sample is 
identified by measuring substantially increased detectably labeled cholesterol within the host cell, 
compared to what would be measured in the absence of such an agonist. 

Also included in the present invention is a mutant mouse comprising a homozygous or 
heterozygous disruption of endogenous, chromosomal NPCILI wherein, preferably, the mouse 
does not produce any functional NPCILI protein. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention includes an NPCILI polypeptide from rat and from mouse along 
with polynucleotides encoding the respective polypeptides. Preferably, the rat NPCILI 
polypeptide comprises the amino acid sequence set forth in SEQ ID NO: 2 and the mouse NPCILI 
polypeptide comprises the amino acid sequence set forth in SEQ ID NO. 12. The lai NPCILI 
polynucleotide of SEQ ID NO:l or 10 encodes the rat NPCILI polypeptide. The mouse NPCILI 
polynucleotide of SEQ ID NO: 11 or 13 encodes the mouse NPCILI polypeptide. 
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The present invention includes any polynucleotide or polypeptide comprising a nucleotide 
or amino acid sequence referred to, below, in Table L 



Table 1- Polynucleotides and Polypeptides of the Invention, 



Polvnucleotide or Polypeptide 


Sequence Identifier 


Rat NPCILI polynucleotide 


SEQ ID NO: 1 


Rat NPCILI Dolvoeotide 


SEQ ID NO: 2 


Human NPCILI polynucleotide 


SEQ ID NO: 3 


Human NPCILI DolvDCDtide 


SEQ ID NO: 4 


Rat NPCILI expressed sequence tag 
603662080F1 fDartial sequence) 


SEQ ID NO: 5 


Rat NPCILI expressed sequence tag 
603665037F1 (partial sequence) 


SEQ ID NO: 6 


Rat NPCILI expressed sequence tag 
604034587F1 (partial sequence) 


SEQ ED NO: 7 


EST 603662080F1 with downstream 
sequences added 


SEQ ID NO: 8 


EST 603662080F1 with upstream and 
downstream sequences added 


SEQ ID NO: 9 


Back-translated polynucleotide sequence of 
rat NPCILI 


SEQ ID NO: 10 


Mouse NPCILI polynucleotide 


SEQ ID NO: 11 


Mouse NPCILI polypeptide 


SEQ ID NO: 12 


Back-translated polynucleotide sequence of 
mouse NPCILI 


SEQ ID NO: 13 



5 



A human NPCILI is also disclosed under Genbank Accession Number AF192522. As 
discussed below, the nucleotide sequence of the rat NPCILI set forth in SEQ ID NO: 1 was 
obtmned from an expressed sequence tag (EST) from a rat jejunum enterocyte cDNA library. SEQ 
ID NOs: 5-7 include partial nucleotide sequences of three independent cDNA clones. The 
10 downstream sequence of the SEQ ID NO: 5 EST (603662080F1) were determined; the sequencing 
data from these experiments are set forth in SEQ ID NO: 8. The upstream sequences were also 
determined; these data are set forth in SEQ ID NO: 9. 
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SEQ ID NOs: 43 and 44 are the nucleotide and amino acid sequence, respectively, of 
human NPCILI which is disclosed under Genbank Accession No,: AF192522 (see Davies, et a/., 

(2000) Genomics 65(2): 137-45). 

SEQ ID NO: 45 is tiie nucleotide sequence of a mouse NPCILI which is disclosed under 

Genbank Accession No. AK078947. 

Molecular Biology 

In accordance with ttie present invention there may be employed conventional molecular 
biology, microbiology, and recombinant DNA techniques within the skill of the art. Such 
techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, Molecular 
Cloning: A T .aboratorv Manual Second Edition (1989) Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York (herein "Sambrook, et al, 1989"); DNA Cloning: A Practical 
Approach. Volumes I and II (D.N. Glover ed. 1985); Oligonucleo tide Synthesis (MJ. Gait ed. 
1984); Nucleic Acid Hvbridization (B.D. Hames & S.J. Higgins eds. (1985)); Transcription And 
Translation (B.D. Hames & S.J. Higgins, eds. (1984)); Animal CeU Culture (R.L Freshney, ed. 
(1986)); Tmmobilize d Cells And Enzymes (IRL Press, (1986)); B. Perbal, A Practical Guide To 
Molecular Cloning (1984); RM. Ausubel, et al. (eds.), Current Protocols in Molecular Biology, 

John Wiley & Sons, Inc. (1994). 

The back-translated sequences of SEQ ID NO: 10 and of SEQ ID NO: 13 uses the single- 
letter code shown in Table 1 of Annex C, Appendix 2 of the PCT Administrative Instruction in tiie 
Manual of Patent Examination Procedure. 

A "polynucleotide", "nucleic acid " or "nucleic acid molecule" may refer to the phosphate 
ester polymeric form of ribonucleo sides (adenosine, guanosine, uridine or cytidine; "RNA 
molecules") or deoxyribonucleosides (deoxyadenosine, deoxyguanosine, deoxythymidine, or 
deoxycytidine; "DNA molecules"), or any phosphoester analogs thereof, such as phosphorofhioates 
and thioesters, in single stranded form, double-stranded form or otherwise. 

A "polynucleotide sequence", "nucleic acid sequence" or "nucleotide sequence" is a series 
of nucleotide bases (also called "nucleotides") in a nucleic acid, such as DNA or RNA, and means 
any chain of two or more nucleotides. 

A "coding sequence" or a sequence "encoding" an expression product, such as a RNA, 
polypeptide, protein, or enzyme, is a nucleotide sequence that, when expressed, results in 

production of the product. 

The term "gene" means a DNA sequence that codes for or corresponds to a particular 
sequence of ribonucleotides or amino acids which comprise all or part of one or more RNA 
molecules, proteins or enzymes, and may or may not include regulatory DNA sequences, such as 
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promoter sequences, which determine, for example, the conditions under which the gene is 
expressed. Genes may be transcribed from DNA to RNA which may or may not be translated into 

an amino acid sequence. 

The present invention includes nucleic acid fragments of any of SEQ ID NOs: 1, 5-11 or 
13. A nucleic acid "fragment" includes at least about 30 (e.g,, 31, 32, 33, 34), preferably at least 
about 35 ie,g, 25, 26, 27, 28, 29, 30, 31, 32, 33 or 34), more preferably at least about 45 (e,g„ 35, 
36, 37, 38, 39, 40, 41, 42, 43 or 44), and most preferably at least about 126 or more contiguous 
nucleotides {e.g., 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 150, 160, 170, 180, 190, 
200, 300, 400, 500, 1000 or 1200) from any of SEQ ID NOs: 1, 5-11 or 13. 

The present invention also includes nucleic acid fragments consisting of at least about 7 
(^.g., 9, 12, 17, 19), preferably at least about 20 (e.g., 30, 40, 50, 60), more preferably about 70 
(e.g., 80, 90, 95), yet more preferably at least about 100 (e.g., 105, 110, 1 14) and even more 
preferably at least about 115 (e.g., 117, 119, 120, 122, 124, 125, 126) contiguous nucleotides from 
any of SEQ ID NOs: 1, 5-11 or 13. 

As used herein, the term "oligonucleotide" refers to a nucleic acid, generally of no more 
than about 100 nucleotides (e.g., 30, 40, 50, 60, 70, 80, or 90), that may be hybridizable to a 
genomic DNA molecule, a cDNA molecule, or an mRNA molecule encoding a gene, mRNA, 
cDNA, or other nucleic acid of interest. Oligonucleotides can be labeled, e.g., by incorporation of 
^^P-nucleotides, ^H-nucleotides, ^"^C-nucleotides, ^^S -nucleotides or nucleotides to which a label, 
such as biotin, has been covalently conjugated. In one embodiment, a labeled oligonucleotide can 
be used as a probe to detect the presence of a nucleic acid. In another embodiment, 
oligonucleotides (one or both of which may be labeled) can be used as PGR primers, either for 
cloning full length or a fragment of the gene, or to detect the presence of nucleic acids. Generally, 
oligonucleotides are prepared synthetically, preferably on a nucleic acid synthesizer. 

A "protein sequence", "peptide sequence" or "polypeptide sequence" or "amino acid 
sequence" may refer to a series of two or more amino acids in a protein, peptide or polypeptide. 

"Protein", "peptide" or "polypeptide" includes a contiguous string of two or more amino 
acids. Preferred peptides of the invention include those set forth in any of SEQ ID NOs: 2 or 12 as 
well as variants and fragments thereof. Such fragments preferably comprise at least about 10 
(^.g.,11, 12, 13, 14, 15, 16, 17, 18 or 19), more preferably at least about 20 (e.g., 21, 22, 23, 24, 25, 
26, 27, 28, 29, 30, 35, 40), and yet more preferably at least about 42 (e.g., 43, 44, 45, 46, 47, 48, 
49, 50, 60, 70, 80, 90, 100, 110, 120 or 130) or more contiguous amino acid residues from any of 
SEQ ID NOs: 2 or 12. 
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The present invention also includes polypeptides, preferably antigenic polypeptides, 
consisting of at least about 7 (e.g., 9, 10, 13, 15, 17, 19), preferably at least about 20 {e.g., 22, 24, 
26, 28), yet more preferably at least about 30 (e.g., 32, 34, 36, 38) and even more preferably at 
least about 40 (e.g., 41, 42) contiguous amino acids from any of SEQ ID NOs: 2 or 12. 

The polypeptides of the invention can be produced by proteolytic cleavage of an intact 
peptide, by chemical synthesis or by the application of recombinant DNA technology and are not 
limited to polypeptides delineated by proteolytic cleavage sites. The polypeptides, either alone or 
cross-linked or conjugated to a carrier molecule to render them more immunogenic, are useful as 
antigens to elicit the production of antibodies and fragments thereof. The antibodies can be used, 
e.g., in itnmunoassays for inmiunoaffinity purification or for inhibition of NPCILI, etc. 

The terms "isolated polynucleotide" or "isolated polypeptide" include a polynucleotide 
(e.g., RNA or DNA molecule, or a mixed polymer) or a polypeptide, respectively, which are 
partially or fully separated from other components that are normally found in cells or in 
recombinant DNA expression systems. These components include, but are not limited to, cell 
membranes, cell walls, ribosomes, polymerases, serum components and extraneous genomic 
sequences. 

An isolated polynucleotide or polypeptide will, preferably, be an essentially homogeneous 
composition of molecules but may contain some heterogeneity. 

" AmpUfication" of DNA as used herein may denote the use of polymerase chain reaction 
(PGR) to increase the concentration of a particular DNA sequence within a mixture of DNA 
sequences. For a description of PGR see Saiki, et aL, Science (1988) 239:487. 

The term "host cell" includes any cell of any organism that is selected, modified, 
transfected, transformed, grown, or used or manipulated in any way, for the production of a 
substance by the cell, for example the expression or replication, by the cell, of a gene, a DNA or 
RNA sequence or a protein. Preferred host cells include Chinese hamster ovary (GHO) cells, 
murine macrophage J774 cells or any other macrophage cell Ime and human intestinal epithelial 
Gaco2 cells. 

The nucleotide sequence of a nucleic acid may be determined by any method known in the 
art (e.g., chemical sequencing or enzymatic sequencing). "Ghemical sequencing" of DNA includes 
methods such as that of Maxam and Gilbert (1977) (Proc. Nad. Acad. Sci. USA 74:560), in which 
DNA is randomly cleaved using individual base-specific reactions. "Enzymatic sequencing" of 
DNA includes methods such as that of Sanger (Sanger, et aL, (1977) Proc. Nad. Acad. Sci. USA 
74:5463). 
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The nucleic acids herein may be flanked by natural regulatory (expression control) 
sequences, or may be associated with heterologous sequences, including promoters, internal 
ribosome entry sites (IRES) and other ribosome binding site sequences, enhancers, response 
elements, suppressors, signal sequences, polyadenylation sequences, introns, 5'- and 3'- non-coding 
regions, and the like. 

In general, a "promoter" or "promoter sequence" is a DNA regulatory region capable of 
binding an RNA polymerase in a cell {e.g., direcdy or through other promoter-bound proteins or 
substances) and initiating transcription of a coding sequence. A promoter sequence is, in general, 
bounded at its 3' terminus by the transcription initiation site and extends upstream (5' direction) to 
include the mitiimum number of bases or elements necessary to itiitiate transcription at any level. 
Within the promoter sequence may be found a transcription initiation site (conveniently defined, 
for example, by mapping with nuclease SI), as well as protein binding domains (consensus 
sequences) responsible for the binding of RNA polymerase. The promoter may be operably 
associated widi other expression control sequences, including enhancer and repressor sequences or 
with a nucleic acid of the invention. Promoters which may be used to control gene expression 
include, but are not limited to, cytomegalovirus (CMV) promoter (U.S. Patent Nos. 5,385,839 and 
5,168,062), the SV40 early promoter region (Benoist, et al, (1981) Nature 290:304-310), the 
promoter contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto, et aL, (1980) 
Cell 22:787-797), the herpes thymidine kinase promoter (Wagner, et aL, (1981) Proc. Nad. Acad. 
Sci. USA 78:1441-1445), the regulatory sequences of the metallothionein gene (Brinster, et aL, 
(1982) Nature 296:39-42); prokaryotic expression vectors such as the ^-lactamase promoter 
(Villa-Komaroff, et al., (1978) Proc. Natl. Acad. Sci. USA 75:3727-3731), or the tac promoter 
(DeBoer, et aL, (1983) Proc. Natl. Acad. ScL USA 80:21-25); see also "Useful proteins from 
recombinant bacteria" in Scientific American (1980) 242:74-94; and promoter elements from yeast 
or other fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK 
(phosphoglycerol kinase) promoter or the alkaline phosphatase promoter. 

A coding sequence is "under the control of, "functionally associated with" or "operably 
associated with" transcriptional and translational control sequences in a cell when the sequences 
direct RNA polymerase mediated transcription of the coding sequence into RNA, preferably 
mJRNA, which then may be RNA spliced (if it contains introns) and, optionally, translated into a 
protein encoded by the coding sequence. 

The terms "express" and "expression" mean allowing or causing the information in a gene, 
RNA or DNA sequence to become manifest; for example, producing a protein by activating the 
cellular functions involved in transcription and translation of a corresponding gene. A DNA 
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sequence is expressed in or by a cell to form an "expression product" such as an RNA (e.g., 
mRNA) or a protein. The expression product itself may also be said to be "expressed" by the cell. 

The term "transformation" means the introduction of a nucleic acid into a cell. The 
introduced gene or sequence may be called a "clone". A host cell that receives the introduced 
DNA or RNA has been "transformed" and is a "transformant" or a "clone." The DNA or RNA 
introduced to a host cell can come from any source, including cells of the same genus or species as 
the host cell, or from cells of a different genus or species. 

The term "vector" includes a vehicle (e.g., a plasmid) by which a DNA or RNA sequence 
can be introduced into a host cell, so as to transform the host and, optionally, promote expression 
and/or repUcation of the introduced sequence. 

Vectors that can be used in this invention include plasmids, viruses, bacteriophage, 
integratable DNA fragments, and other vehicles that miay faciUtate introduction of the nucleic acids 
into the genome of the host. Plasmids are the most conmionly used form of vector but aU other 
forms of vectors which serve a similar function and which are, or become, known in the art are 
suitable for use herein. See, e.g., Pouwels, et al. Cloning Vectors: A Laboratory Manual. 1985 
and Supplements, Elsevier, N.Y., and Rodriguez et aL (eds.). Vectors: A Surv^ev of Molecular 
Cloning Vectors and Their Uses. 1988, Buttersworth, Boston, MA. 

-The term "expression system" means a host cell and compatible vector which, under 
suitable conditions, can express a protein or nucleic acid which is carried by the vector and 
introduced to the host cell. Conmion expression systems include E, coli host cells and plasmid 
vectors, insect host cells and Baculovirus vectors, and mammalian host cells and vectors. 

Expression of nucleic acids encoding the NPCILI polypeptides of this invention can be 
carried out by conventional methods in either prokaryotic or eukaryotic cells. Although E. coli 
host cells are employed most frequently in prokaryotic systemis, many other bacteria, such as 
various strains of Pseudomonas and Bacillus, are known in the art and can be used as well. 
Suitable host cells for expressing nucleic acids encoding the NPCILI polypeptides include 
prokaryotes and higher eukaryotes. Prokaryotes include both gram-negative and gram-positive 
organisms, e.g„ E. coli and jB. subtilis. Higher eukaryotes include established tissue culture cell 
lines from animal cells, both of non-mammalian origin, e.g,, insect cells, and birds, and of 
manmiaUan origin, e.g., human, primates, and rodents. 

Prokaryotic host- vector systems include a wide variety of vectors for many different 
species. A representative vector for ampUfying DNA is pBR322 or many of its derivatives ( e.g., 
pUC18 or 19). Vectors that can be used to express the NPCILI polypeptides include, but are not 
limited to, those containing the lac promoter (pUC-series); trp promoter (pBR322-trp); Ipp 
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promoter (the pIN-series); lambda-pP or pR promoters (pOTS); or hybrid promoters such as ptac 
(pDR540). See Brosius et aL, "Expression Vectors Employing Lambda-, trp-, lac-, and Ipp-derived 
Promoters", in Rodriguez and Denhardt (eds.) Vectors: A Survey of Molecular Cloning Vectors 
and Their Uses. 1988, Buttersworth, Boston, pp. 205-236. Many polypeptides can be expressed, at 
high levels, in an E.coli/T7 expression system as disclosed in U.S. Patent Nos. 4,952,496, 
5,693,489 and 5,869,320 and in Davanloo, P., etaL, (1984) Proc. Nad. Acad. Sci. USA 81: 2035- 
2039; Studier, F. W., etaL, (1986) J. MoL Biol. 189: 113-130; Rosenberg, A. H., etal, (1987) 
Gene 56: 125-135; and Dunn, J. J., et aL, (1988) Gene 68: 259, 

Higher eukaryotic tissue culture cells may also be used for the recombinant production of 
the NPCILI polypeptides of the invention. Although any higher eukaryotic tissue culture cell Une 
might be used, including insect baculovirus expression systems, manunalian cells ate preferred. 
Transformation or transfection and propagation of such cells have become a routine procedure. 
Examples of useful cell lines include HeLa cells, Chinese hamster ovary (CHO) cell lines, J774 
cells, Caco2 cells, baby rat kidney (BRK) cell lines, insect cell lines, bird cell lines, and monkey 
(COS) cell lines. Expression vectors for such cell lines usually include an origin of replication, a 
promoter, a translation initiation site, RNA splice sites (if genomic DNA is used), a 
polyadenylation site, and a transcription termination site. These vectors also, usually, contain a 
selection gene or ampUfication gene. Suitable expression vectors may be plasmids, viruses, or 
retroviruses carrying promoters derived, e.g., from such sources as adenovirus, SV40, 
parvoviruses, vaccinia virus, or cytomegalovirus. Examples of expression vectors include 

pCR®3.1, pCDNAl, pCD (Okayama, et aL, (1985) Mol. Cell BioL 5:1 136), pMClneo Poly-A 
(Thomas, et aL, (1987) Cell 5i:503), pREP8, pSVSPORT and derivatives thereof, and baculovirus 
vectors such as pAC373 or pAC610. One embodiment of the invention includes membrane bound 
NPCILL In this embodiment, NPCILI can be expressed in the cell membrane of a eukaryotic cell 
and the membrane bound protein can be isolated from the cell by conventional methods which are 
known in the art. 

The present invention also includes ftisions which include the NPCILI polypeptides and 
NPCILI polynucleotides of the present invention and a second polypeptide or polynucleotide 
moiety, which may be referred to as a "tag". The fusions of the present invention may comprise 
any of the polynucleotides or polypeptides set forth in Table 1 or any subsequence or fragment 
thereof (discussed above). The fused polypeptides of the invention may be conveniently 
constructed, for example, by insertion of a polynucleotide of the invention or fragment thereof into 
an expression vector. The fusions of the invention may include tags which facilitate purification or 
detection. Such tags include glutathione-S-transferase (GST), hexahistidine (His6) tags, maltose 



wo 2004/009772 



PCT/US2003/022467 



- 12- 

binding protein (MBP) tags, haemagglutinin (HA) tags, cellulose binding protein (CBP) tags and 

•^11 1- 32t» 35o 3tt 99mrn« 123t UIt*, 68r^^ IStj 125t 131t 113mT„ 76t>^ 

myctags. Detectable tags such as ^^P, ''^S, ^H, ^^™Tc, I, In, Gra, F, I, 1, In, Br, 
^^Ga, ^^™Tc, ^^I, ^^^In and ^^Ga may also be used to label the polypeptides and polynucleotides of 
the invention. Methods for constructing and using such fusions are very conventional and well 
known in the art. 

Modifications (e.g., post-translational modifications) that occur in a polypeptide often will 
be a function of how it is made. For polypeptides made by expressing a cloned gene in a host, for 
instance, the nature and extent of the modifications, in large part, will be determined by the host 
cell's post-translational modification capacity and die modification signals present in the 
polypeptide amino acid sequence. For instance, as is well known, glycosylation often does not 
occur in bacterial hosts such as E. colL Accordingly, when glycosylation is desired, a polypeptide 
can be expressed in a glycosylating host, generally a eukaryotic cell. Insect cells often carry out 
post-translational glycosylations which are similar to those of manmialian cells. For this reason, 
insect cell expression systems have been developed to express, efficiently, mammalian proteins 
having native patterns of glycosylation. An insect cell which may be used in this invention is any 
cell derived from an organism of the class Insecta. Preferably, the insect is Spodopterafruigiperda 
(Sf9 or Sf21) or Trichoplusia ni (High 5). Examples of insect expression systems that can be used 
with the present invention, for example to produce NPCILI polypeptide, include Bac-To-Bac 
(Invitrogen Corporation, Carlsbad, CA) or Gateway (Invitrogen Corporation, Carlsbad, CA). If 
desired, deglycosylation enzymes can be used to remove carbohydrates attached during production 
in eukaryotic expression systems. 

Other modifications may also include addition of aliphatic esters or amides to the 
polypeptide carboxyl terminus. The present invention also includes analogs of the NPCILI 
polypeptides which contain modifications, such as incorporation of unnatural amino acid residues, 
or phosphorylated amino acid residues such as phosphotyrosine, phosphoserine or 
phosphothreonine residues. Other potential modifications include sulfonation, biotinylation, or the 
addition of other moieties. For example, the NPCILI polypeptides of the invention may be 
appended with a polymer which increases the half-life of the peptide in the body of a subject. 
Preferred polymers include polyethylene glycol (PEG) (e.g., PEG with a molecular weight of 2 
kDa, 5 kDa, 10 kDa, 12 kDa, 20 kDa, 30 kDa and 40 kDa), dextran and monomethoxypolyethylene 
glycol (mPEG). 

The peptides of the invention may also be cycUzed. Specifically, the amino- and carboxy- 
terminal residues of an NPCILI polypeptide or two internal residues of an NPCILI polypeptide of 
the invention can be fused to create a cyclized peptide. Methods for cycUzing peptides are 
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convetitional and very well known in the art; for example see Gurrath, et aL, (1992) Eur. J . 

Biochem. 210:911-921. 

The present invention contemplates any superficial or slight modification to the amino acid 
or nucleotide sequences which correspond to the polypeptides of the invention. In particular, the 
present invention contemplates sequence conservative variants of the nucleic acids which encode 
the polypeptides of the invention. "Sequence-conservative variants" of a polynucleotide sequence 
are those in which a change of one or more nucleotides in a given codon results in no alteration in 
the amino acid encoded at that position. Function-conservative variants of the polypeptides of the 
invention are also contemplated by the present invention. "Function-conservative variants" are 
those in which one or more amino acid residues in a protein or enzyme have been changed without 
altering the overall conformation and function of the polypeptide, including, but, by no means, 
limited to, replacement of an amino acid with one having similar properties. Amino acids with 
similar properties are well known in the art. For example, polar/hydrophilic amino acids which 
may be interchangeable include asparagine, glutandne, serine, cysteine, threonine, lysine, arginine, 
histidine, aspartic acid and glutamic acid; nonpolar/hydrophobic amino acids which may be 
interchangeable include glycine, alanine, valine, leucine, isoleucine, proline, tyrosine, 
phenylalanine, tryptophan and methionine; acidic amino acids which may be interchangeable 
include aspartic acid and glutamic acid and basic amino acids which may be interchangeable 
include histidine, lysine and arginine. 

The present invention includes polynucleotides encoding rat or mouse NPCILI and 
ft-agments thereof as well as nucleic acids which hybridize to the polynucleotides. Preferably, the 
nucleic acids hybridize under low stringency conditions, more preferably under moderate 
stringency conditions and most preferably under high stringency conditions, A nucleic acid 
molecule is "hybridizable" to another nucleic acid molecule, such as a cDNA, genomic DNA, or 
RNA, when a single stranded form of the nucleic acid molecule can anneal to the other nucleic acid 
molecule under the appropriate conditions of temperature and solution ionic strength (see 
Sambrook, et ah, supra). The conditions of temperature and ionic strength determine the 
"stringency" of the hybridization. Typical low stringency hybridization concUtions are 55°C, 5X 
SSC, 0.1% SDS, 0.25% milk, and no formamide at 42°C; or 30% formamide, 5X SSC, 0.5% SDS 
at 42°C. Typical, moderate stringency hybridization conditions are similar to the low stringency 
conditions except the hybridization is carried out in 40% formamide, with 5X or 6X SSC at 42°C. 
High stringency hybridization conditions are similar to low stringency conditions except the 
hybridization conditions are carried out in 50% formamide, 5X or 6X SSC and, optionally, at a 
higher temperature (e.g., higher than 42°C: 57 °C, 59 °C, 60 °C, 62 X, 63 °C, 65°C or 68 °C). In 
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general, SSC is 0.15M NaCl and 0.015M Na-citrate. Hybridization requires that the two nucleic 
acids contain complementary sequences, although, depending on the stringency of the 
hybridization, mismatches between bases are possible. The appropriate stringency for hybridizing 
nucleic acids depends on the length of the nucleic acids and the degree of complementation, 
5 variables well known in the art. The greater the degree of similarity or homology between two 
nucleotide sequences, the higher the stringency under which the nucleic acids may hybridize. For 
hybrids of greater than 100 nucleotides in length, equations for calculating the melting temperature 
have been derived (see Sambrook, et aL, supra, 9.50-9.51). For hybridization with shorter nucleic 
acids, Le., oligonucleotides, the position of mismatches becomes more important, and the length of 
10 the oligonucleotide determines its specificity (see Sambrook, et aL, supra). 

Also included in the present invention are polynucleotides comprising nucleotide 
sequences and polypeptides comprising amino acid sequences which are at least about 70% 
identical, preferably at least about 80% identical, more preferably at least about 90% identical and 
most preferably at least about 95% identical (e.g., 95%, 96%, 97%, 98%, 99%, 100%) to the 
15 reference rat NPCILI nucleotide {e.g,, any of SEQ ID NOs: 1 or 5-10) and amino acid sequences 
(^.g., SEQ ID NO: 2) or the mouse NPCILI nucleotide {e.g., any of SEQ ID NOs: 11 or 13) and 
amino acids sequences (e.g., SEQ ID NO: 12), when the comparison is performed by a BLAST 
algorithm wherein the parameters of tiie algorithm are selected to give the largest match between 
the respective sequences over the entire length of the respective reference sequences. Polypeptides 
20 comprising amino acid sequences which are at least about 70% similar, preferably at least about 
80% similar, more preferably at least about 90% similar and most preferably at least about 95% 
similar (e.g., 95%, 96%, 97%, 98%, 99%, 100%) to the reference rat NPCILI amino acid sequence 
of SEQ ID NO: 2 or the mouse NPCILI amino acid sequence of SEQ ID NO: 12, when the 
comparison is performed with a BLAST algorithm wherein the parameters of the algorithm are 
25 selected to give the largest match between flie respective sequences over the entire length of the 
respective reference sequences, are also included in the present invention. 

Sequence identity refers to exact matches between the nucleotides or amino acids of two 
sequences which are being compared. Sequence similarity refers to both exact matches between 
die amino acids of two polypeptides which are being compared in addition to matches between 
30 nonidentical, biochemically related amino acids. Biochemically related amino acids which share 
similar properties and may be interchangeable are discussed above. 

The following references regarding the BLAST algorithm are herein incorporated by 
reference: BLAST ALGORITHMS: Altschul, S.F., et aZ., (1990) J. Mol. Biol. 215:403-410; Gish, 
W., etal, (1993) Nature Genet. 3:266-272; Madden, T.L., et aZ., (1996) Meth. EnzymoL 266:131- 



wo 2004/009772 PCT/US2003/022467 

-15- 

141; Altschul, S.F., et cd., (1997) Nucleic Acids Res. 25:3389-3402; Zhang, J., et al, (1997) 
Genome Res. 7:649-656; Wootton, J.C., etal, (1993) Comput. Chem. 17:149-163; Hancock, J.M., 
et aL, (1994) Comput. Appl. Biosci. 10:67-70; ALIGNMENT SCORING SYSTEMS: Dayhoff, 
M.O., et al, "A model of evolutionary change in proteins." in Atlas of Protein Sequence and 
Structure , (1978) vol. 5, suppl. 3. M.O. Dayhoff (ed.), pp. 345-352, Natl. Biomed. Res. Found., 
Washington, DC; Schwartz, R.M., et aL, "Matrices for detecting distant relationships." in Atlas of 
Protein Sequence and Structure , (1978) vol. 5, suppl. 3." M.O. Dayhoff (ed.), pp. 353-358, Natl. 
Biomed. Res. Found., Washington, DC; Altschul, S.F., (1991) J. MoL Biol. 219:555-565; States, 
D.J., et aL, (1991) Methods 3:66-70; Henikoff, S., et aL, (1992) Proc. Nad. Acad. Sci, USA 
89:10915-10919; Altschul, S.F., etal, (1993) J. MoL Evol. 36:290-300; ALIGNMENT 
STATISTICS: KarUn, S., et aL, (1990) Proc. Nad. Acad. Sci. USA 87:2264-2268; Karlin, S., et 
al., (1993) Proc. NatL Acad. Sci. USA 90:5873-5877; Dembo, A., et aL, (1994) Ann. Prob. 
22:2022-2039; and Altschul, S.F. "Evaluating the statistical significance of multiple distinct local 
alignments." in Theoretical and Computational Methods in Genome Research (S. Suhai, ed.), 
(1997) pp. 1-14, Plenum, New York. 

Protein Purification 

The proteins, polypeptides and antigenic fragments of this invention can be purified by 
standard methods, including, but not limited to, salt or alcohol precipitation, affinity 
chromatography (e.g., used in conjunction with a purification tagged NPCILI polypeptide as 
discussed above), preparative disc-gel electrophoresis, isoelectric focusing, high pressure Uquid 
chromatography (HPLC), reversed-phase HPLC, gel filtration, cation and anion exchange and 
partition chromatography, and countercurrent distribution. Such purification methods are well 
knovra in the art and are disclosed, e.g., in ""Guide to Protein Purification". Methods in 
Enzvmology , Vol. 182, M. Deutscher, Ed., 1990, Academic Press, New York, NY. 

Purification steps can be followed by performance of assays for receptor binding activity as 
described below. Particularly where an NPCILI polypeptide is being isolated from a cellular or 
tissue source, it is preferable to include one or more inhibitors of proteolytic enzymes in the assay 
system, such as phenylmethanesulfonyl fluoride (PMSF), Pefabloc SC, pepstatin, leupeptin, 
chymo statin and EDTA. 

Antibody Molecules 
Antigenic (including immunogenic) fragments of the NPCILI polypeptides of the 
invention are within the scope of the present invention (e.g., 42 or more contiguous amino acids 
from SEQ ID NO: 2, 4 or 12). The antigenic peptides may be useful, inter alia, for preparing 
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antibody molecules which recognize NPCILI. Anti-NPCILI antibody molecules are useful 

NPCILI antagonists. 

An antigen is any molecule that can bind specifically to an antibody. Some antigens 
cannot, by themselves, eUcit antibody production. Those that can induce antibody production are 
immunogens. 

Preferably, anti-NPClLl antibodies recognize an antigenic peptide comprising an amino 
acid sequence selected from SEQ ID NOs: 39-42 (e.g., an antigen derived from rat NPCILI). 
More preferably, die antibody is A0715, A0716, A0717, A0718, A0867, A0868, A1801 or A1802. 

The term "antibody molecule " includes, but is not limited to, antibodies and fragments 
(preferably antigen-binding fragments) thereof. The term includes monoclonal antibodies, 
polyclonal antibodies, bispecific antibodies. Fab antibody fragments, F(ab)2 antibody fragments, Fv 
antibody fragments (e.g., Vh or Vl), single chain Fv antibody fragments and dsFv antibody 
fragments. Furthermore, the antibody molecules of the invention may be fully human antibodies, 
mouse antibodies, rat antibodies, rabbit antibodies, goat antibodies, chicken antibodies, humanized 
antibodies or chimeric antibodies. 

Although it is not always necessary, when NPCILI polypeptides are used as antigens to 
elicit antibody production in an immunologically competent host, smaller antigenic fragments are, 
preferably, first rendered more immunogenic by cross-linking or concatenation, or by coupling to 
an immunogenic carrier molecule (i.e., a macromolecule having the property of independentiy 
eliciting an immunological response in a host animal, such as diptheria toxin or tetanus). Cross- 
linking or conjugation to a carrier molecule may be required because small polypeptide fragments 
sometimes act as haptens (molecules which are capable of specifically binding to an antibody but 
incapable of eliciting antibody production, i.e., they are not immunogenic). Conjugation of such 
fragments to an immunogenic carrier molecule renders them more immunogenic through what is 
commonly known as the "carrier effect". 

Carrier molecules include, e.g„ proteins and natural or synthetic polymeric compounds 
such as polypeptides, polysaccharides, lipopoly saccharides etc. Protein carrier molecules are 
especially preferred, including, but not limited to, keyhole limpet hemocyanin and mammalian 
serum proteins such as human or bovine gammaglobulin, human, bovine or rabbit serum albumin, 
or methylated or other derivatives of such proteins. Other protein carriers will be apparent to those 
skilled in the art. Preferably, the protein carrier will be foreign to the host animal in which 
antibodies against the fragments are to be elicited. 

Covalent coupling to the carrier molecule can be achieved using methods well known in 
the art, the exact choice of which will be dictated by the nature of the carrier molecule used. When 
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the immunogenic carrier molecule is a protein, the fragments of the invention can be coupled, e.g.^ 
using water-soluble carbodiimides such as dicyclohexylcarbodiimide or glutaraldehyde. 

Coupling agents, such as these, can also be used to cross-link the fragments to themselves 
without the use of a separate carrier molecule. Such cross-linking into aggregates can also increase 
immunogenicity. Immunogenicity can also be increased by the use of known adjuvants, alone or 
in combination with coupling or aggregation. 

Adjuvants for the vaccination of animals include, but are not limited to. Adjuvant 65 
(containing peanut oil, mannide monooleate and aluminum monostearate); Freunds complete or 
incomplete adjuvant; mineral gels such as aluminum hydroxide, aluminum phosphate and alum; 
surfactants such as hexadecylamine, octadecylamine, lysolecithin, dimethyldioctadecylammonium 
bromide, N,N-dioctadecyl-N*,N'-bis(2-hydroxymethyl) propanediamine, 

methoxyhexadecylglycerol and pluronic polyols; polyanions such as pyran, dextran sulfate, poly 
IC, polyacrylic acid and carbopol; peptides such as muramyl dipeptide, dimethylglycine and 
tuftsin; and oil emulsions. The polypeptides could also be administered following incorporation 
into liposomes or other microcarriers. 

Information concerning adjuvants and various aspects of immunoassays are disclosed, e.g., 
in the series by P. Tijssen. Practice and Theory of Enzvme Lmnunoassavs , 3rd Edition, 1987, 
Elsevier, New York. Other useful references covering methods for preparing polyclonal antisera 
include Microbiology , 1969, Hoeber Medical Division, Harper and Row; Landsteiner, Specificity 
of Serological Reactions , 1962, Dover PubUcations, New York, and Williams, et aL, Methods in 
Tmrnunol ogv and hnmunochemistrv . Vol. 1, 1967, Academic Press, New York. 

The anti-NPClLl antibody molecules of the invention preferably recognize human, mouse 
or rat NPCILI; however, the present invention includes antibody molecules which recognize 
NPCILI from any species, preferably manamals (e.g., cat, sheep or horse). The present invention 
also includes complexes comprising an NPCILI polypeptide of the invention and an anti-NPClLl 
antibody molecule. Such complexes can be made by simply contacting the antibody molecule with 
its cognate polypeptide. 

Various methods may be used to make the antibody molecules of the invention. Human 
antibodies can be made, for example, by methods which are similar to those disclosed in U.S. 
Patent Nos. 5,625,126; 5,877,397; 6,255,458; 6,023,010 and 5,874,299. 

Hybridoma cells which produce the monoclonal anti-NPClLl antibodies may be produced 
by methods which are commonly known in the art. These methods include, but are not Umited to, 
the hybridoma technique originally developed by Kohler, et aL, (1975) (Nature 256:495-497), as 
well as the trioma technique (Hering, et aL, (1988) Biomed. Biochim. Acta. 47:211-216 and 
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Hagiwara, et al., (1993) Hum. Antibod. Hybridomas 4:15), the human B-cell hybridoma technique 
(Kozbor, et al, (1983) Immunology Today 4:72 and Cote, et al, (1983) Proc. Natl. Acad. Sci. 
U.S.A 80:2026-2030), and the EB V-hybridoma technique (Cole, et al, in Monoclonal Antibodies 
and Cancer Therapy , Alan R. Liss, Inc., pp. 77-96, 1985). ELISA may be used to determine if 
5 hybridoma cells are expressing anti-NPClLl antibodies. 

The anti-NFC ILl antibody molecules of the present invention may also be produced 
recombinandy (e.g., in an Kcoli/Tl expression system as discussed above). In this embodiment, 
nucleic acids encoding the antibody molecules of the mvention (e.g., Vh or Vl) may be inserted 
into a pet-based plasmid and expressed in the E.coliJTl system. There are several methods by 
10 which to produce recombinant antibodies which are known in the art. An example of a method for 
recombinant production of antibodies is disclosed in U.S. Patent No. 4,816,567. See also Skerra, 
A., etaL, (1988) Science 240:1038-1041; Better, M., etaL, (1988) Science 240:1041-1043 and 
Bird, R.E., etal, (1988) Science 242:423-426. 

The term "monoclonal antibody," includes an antibody obtained from a population of 
15 substantially homogeneous antibodies, Le,, the individual antibodies comprising the population are 
identical except for possible, naturally occurring mutations that may be present in minor amounts. 
Monoclonal antibodies are highly specific, being directed against a single antigenic site. 
Monoclonal antibodies are advantageous in that they may be synthesized by a hybridoma culture, 
essentially uncontaminated by other immunoglobulins. The modifier "monoclonal" indicates the 
20 character of the antibody as being obtained from a substantially homogeneous population of 
antibodies, and is not to be construed as requiring production of the antibody by any particular 
method. The monoclonal antibodies to be used in accordance with the present invention may be 
made by the hybridoma method as described by Kohler, et aL^ (1975) Nature 256:495. 

The term "polyclonal antibody" includes an antibody which was produced among or in the 
25 presence of one or more other, non-identical antibodies. In general, polyclonal antibodies are 
produced from a B -lymphocyte in the presence of several other B-lymphocytes which produced 
non-identical antibodies. Typically, polyclonal antibodies are obtained directiy from an 
immunized animal (e.g., a rabbit). 

A "bispecific antibody" comprises two different antigen binding regions which bind to 
30 distinct antigens. Bispecific antibodies, as well as methods of making and using the antibodies, 
are conventional and very well known in the art. 

Anti-idiotypic antibodies or anti-idiotypes are antibodies directed against the antigen- 
combining region or variable region (called the idiotype) of another antibody molecule. As 
disclosed by Jeme (Jeme, N. K., (1974) Ann, Immunol. (Paris) 125c:373 and Jeme, N. K., et al. 
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(1982) EMBO 1:234), immunization with an antibody molecule expressing a paratope (antigen- 
combining site) for a given antigen (e.g., NPCILI) will produce a group of anti-antibodies, some 
of which share, with the antigen, a complementary structure to the paratope. Immunization with a 
subpopulation of the anti-idiotypic antibodies will, in turn, produce a subpopulation of antibodies 
or immune cell subsets that are reactive to the initial antigen. 

The term ''fully human antibody" refers to an antibody which comprises human 
immunoglobulin sequences only. Similarly, "mouse antibody" refers to an antibody which 
comprises mouse immunoglobulin sequences only. 

"Human/mouse chimeric antibody" refers to an antibody which comprises a mouse variable 
region (Vh and Vl) fused to a human constant region. 

"Humanized" anti-NPClLl antibodies are also within the scope of the present invention. 
Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, which 
contain minimal sequence derived from non-human inmiunoglobulin. For the most part, 
humanized antibodies are human inamunoglobulins (recipient antibody) in which residues from a 
complementary determining region of the recipient are replaced by residues from a complementary 
determining region of a nonhuman species (donor antibody), such as mouse, rat or rabbit, having a 
desired specificity, affinity and capacity. In some instances, Fv framework residues of the human 
immunoglobulin are also replaced by corresponding non-human residues. 

"Single-chain Fv" or "sFv" antibody fragments' include the Vh and/or Vl domains of an 
antibody, wherein these domains are present in a single polypeptide chain. Generally, the sFv 
polypeptide further comprises a polypeptide linker between the Vh and Vl domains which enables 
the sFv to form the desired structure for antigen binding. Techniques described for the production 
of single chain antibodies (U.S. Patent Nos. 5,476,786; 5,132,405 and 4,946,778) can be adapted to 
produce anti-NPClLl specific, single ch^n antibodies. For a review of sFv see Pluckthun in The 
Phamiacologv of Monoclonal Antibodies , vol. 113, Rosenburg and Moore eds. Springer-Verlag, 
N.Y., pp. 269-315 (1994). 

"Disulfide stabilized Fv fragments" and "dsFv" include molecules having a variable heavy 
chain (Vh) and/or a variable light chain (Vl) which are linked by a disulfide bridge. 

Antibody fragments within the scope of the present invention also include F(ab)2 fragments 
which may be produced by enzymatic cleavage of an IgG by, for example, pepsin. Fab fragments 
may be produced by, for example, reduction of F(ab)2 with dithiothreitol or mercaptoethylamine. 
An Fv fragment is a Vl or Vh region. 

Depending on the amino acid sequences of the constant domain of their heavy chains, 
inmiunoglobulins can be assigned to different classes. There are at least five major classes of 
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immunoglobulins: IgA, IgD, IgE, IgG and IgM, and several of these may be further divided into 
subclasses (isotypes), e.g., IgG-1, IgG-2, IgG-3 and IgG-4; IgA-1 and IgA-2. 

The anti-NPClLl antibody molecules of the invention may also be conjugated to a 
chemical moiety. The chemical moiety may be, inter alia, a polymer, a radionuclide or a cytotoxic 
5 factor. Preferably, the chemical moiety is a polymer which increases the half-life of the antibody 
molecule in the body of a subject. Suitable polymers include, but are by no means limited to, 
polyethylene glycol (PEG) {e.g., PEG with a molecular weight of 21dDa, 5 kDa, 10 kDa, 12kDa, 20 
kDa, 30kDa or 40kDa), dextran and monomethoxypolyethylene glycol (mPEG). Methods for 
producing PEGylated anti-IL8 antibodies which are described in U.S. Patent No. 6,133,426 can be 
10 applied to the production of PEGylated anti-NPClLl antibodies of the invention. Lee, et al, 

(1999) (Bioconj. Chem. 10:973-981) discloses PEG conjugated single-chain antibodies. Wen, et 
ah, (2001) (Bioconj. Chemu 12:545-553) discloses conjugating antibodies with PEG which is 
attached to a radiometal chelator (diethylenetriaminpentaacetic acid (DTP A)). 

The antibody molecules of the invention may also be conjugated with labels such as 
15 ^^Tc^^Y, ^^^In, ^^C, ^^I, 'H, ^^^I, '^C, ^^O, ^^N, ^^F, ^^S, ^^Cr, ^^To, ^^Ra, ^^Co, ^^Fe, ^^Se, ^^^Eu, 
^^CU, ^'^Ci, ^^'At, ^^¥b, ^^Sc, ^*^^Pd, ^^Th, ^^K, '^'Gd, ^^Mn, ^^r or '^Fe. 

The antibody molecules of the invention may also be conjugated with fluorescent or 
chemilluminescent labels, including fluorophores such as rare earth chelates, fluorescein and its 
derivatives, rhodamine and its derivatives, isothiocyanate, phycoerythrin, phycocyanin, 
20 allophycocyanin, o-phthaladehyde, fluorescamine, ^^^Eu, dansyl, umbelUferone, luciferin, luminal 
label, isoluminal label, an aromatic acridinium ester label, an imidazole label, an acridinaium salt 
label, an oxalate ester label, an aequorin label, 2,3-dihydrophthalazinediones, biotin/avidin, spin 
labels and stable free radicals. 

The antibody molecules may also be conjugated to a cytotoxic factor such as diptheria 
25 toxin, Pseudomonas aeruginosa exotoxin A chain , ricin A chain, abrin A chain, modeccin A 

chain, alpha-sarcin, Aleurites fordii proteins and compounds {e.g., fatty acids), dianthin proteins, 
Phytolacca americana proteins PAPI, PAPE, and PAP-S, momordica charantia inhibitor, curcin, 
crotin, saponaria officinalis inhibitor, mitogellin, restrictocin, phenomycin, and enomycin. 

Any method known in the art for conjugating the antibody molecules of the invention to the 
30 various moieties may be employed, including those methods described by Hunter, et al., (1962) 
Nature 144:945; David, et al., (1974) Biochemistry 13:1014; Pain, et al., (1981) J. Immunol. 
Meth. 40:219; and Nygren, J., (1982) Histochem. and Cytochem. 30:407. 

Methods for conjugating antibodies are conventional and very well known in the art 
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Screening Assays 

The invention allows the discovery of selective agonists and antagonists of NPCILI (e.g., 
SEQ ID NO: 2, 4 or 12) that may be useful in treatment and management of a variety of medical 
conditions including elevated serum cholesterol. Thus, NPCILI of this invention can be employed 

5 in screening systems to identify agonists or antagonists. Essentially, these systems provide 
methods for bringing together NPCILI, an appropriate, known ligand or agonist or antagonist, 
including cholesterol, ezetimibe, BODBPY-ezetimibe (Altmann, et aL, (2002) Biochim. Biophys. 
Acta 1580(l):77-93) or 4", 6"-bis[(2-fluorophenyl)carbamoyl]-beta-D-cellobiosyl derivative of 11- 
ketotigogenin as described in DeNinno, et aL, (1997) (J. Med. Chem. 40(16):2547-54) (Merck; L- 

10 166,143), and a sample to be tested for the presence of an NPCILI agonist or antagonist. A 
convenient method by which to evaluate whether a sample contains an NPCILI agonist or 
antagonist is to determine whether the sample contains a substance which competes for binding 
between the known agonist or antagonist (e.g., ezetimibe) and NPCILI. 

Ezetimibe can be prepared by a variety of methods well know to those skilled in the art, for 

15 example such as are disclosed in U.S. Patents Nos. 5,631,365, 5,767,1 15, 5,846,966, 6,207,822, 

U.S. Patent Application Publication No. 2002/0193607 and PCT Patent Application WO 93/02048, 
each of which is incorporated herein by reference in its entirety. 

^'Sample", "candidate compound" or "candidate substance" refers to a composition which 
is evaluated in a test or assay, for example, for the ability to agonize or antagonize NPCILI (e.g., 

20 SEQ ID NO: 2, 4 or 12) or a functional fragment thereof. The composition may small molecules, 
peptides, nucleotides, polynucleotides, subatomic particles (e.g., a particles, P particles) or 
antibodies. 

Two basic types of screening systems can be used, a labeled-ligand binding assay (e.g., 
direct binding assay or scintillation proximity assay (SPA)) and a "cholesterol uptake" assay. A 

25 labeled ligand for use in the binding assay can be obtained by labeling cholesterol or a known 

NPCILI agonist or antagonist with a measurable group (e.g., ^^^1 or ^H). Various labeled forms of 
cholesterol are available commercially or can be generated using standard techniques (e.g.. 
Cholesterol- [1,2-^H(N)], Cholesterol-[l,2,6,7-^H(N)] or Cholesterol-[7-^H(N)]; American 
Radiolabeled Chemicals, Inc; St. Louis, MO). In a preferred embodiment, ezetimibe is 

30 fluorescentiy labeled with a BODIPY group (Altmann, et aL, (2002) Biochim. Biophys. Acta 
1580(l):77-93) or labeled with a detectable group such as ^^^I or ^H. 

Direct Biding Assay. Typically, a given amount of NPCILI of the invention (e.g., SEQ ID 
NO: 2, 4 or 12) is contacted with increasing amounts of labeled ligand or known antagonist or 
agonist (discussed above) and the amount of the bound, labeled ligand or known antagonist or 
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agonist is measured after removing unbound, labeled ligand or known antagonist or agonist by 
washing. As the amount of the labeled Ugand or known agonist or antagonist is increased, a point 
is eventually reached at which all receptor binding sites are occupied or saturated. Specific 
receptor binding of the labeled ligand or known agonist or antagonist is abolished by a large excess 
of unlabeled ligand or known agonist or antagonist. 

Preferably, an assay system is used in which non-specific binding of the labeled ligand or 
known antagonist or agonist to the receptor is minimal. Non-specific binding is typically less than 
50%, preferably less than 15%, and more preferably less than 10% of the total binding of the 
labeled ligand or known antagonist or agonist. 

A nucleic acid encoding an NPCILI polypeptide of the invention (e.g., SEQ ID NO: 2, 4 
or 12) can be transfected into an appropriate host cell, whereby the receptor will become 
incorporated into the membrane of the cell. A membrane fraction can then be isolated from the cell 
and used as a source of the receptor for assay. Altematively, the whole cell expressing the receptor 
in the cell surface can be used in an assay. Preferably, specific binding of the labeled ligand or 
known antagonist or agonist to an untransfected/untransformed host cell or to a membrane fraction 
from an untransfected/untransformed host cell will be negligible. 

In principle, a binding assay of the invention could be carried out using a soluble NPCILI 
polypeptide of the invention, e.g., following production and refolding by standard methods from an 
E. coli expression system, and the resulting receptor-labeled ligand complex could be precipitated, 
e.g., using an antibody against the receptor. The precipitate could then be washed and the amount 
of the bound, labeled ligand or antagonist or agonist could be measured. 

In the basic binding assay, the method for identifying an NPCILI agonist or antagonist 
includes: 

(a) contacting NPCILI (e.g., SEQ ID NO: 2 or 4 or 12) or a subsequence thereof, 
in the presence of a known amount of labeled cholesterol or known antagonist or agonist 
{e.g., labeled ezetimibe or labeled L-166,143) with a sample to be tested for the presence of 
an NPCILI agonist or antagonist; and 

(b) measuring the amount of labeled cholesterol or known antagonist or agonist 
bound to the receptor. 

An NPCILI antagonist or agonist in the sample is identified by measuring substantially 
reduced binding of the labeled cholesterol or known antagonist or agonist to NPCILI, compared to 
what would be measured in the absence of such an antagonist or agonist. For example, reduced 
binding between [^H]-cholesterol and NPCILI in the presence of a sample might suggest that the 
sample contains a substance which is competing against [^H] -cholesterol for NPCILI binding. 
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Altematively, a sample can be tested directly for binding to NPCILI (e.g., SEQ ID NO: 2, 
4 or 12). A basic assay of this type may include the following steps: 

(a) contacting NPCILI (e.g., SEQ ID NO: 2 or 4 or 12) or a subsequence thereof with a 
labeled candidate compound (e.g., [^H]-ezetimibe); and 

(b) detecting binding between the labeled candidate compound and NPCILI. 

A candidate compound which is found to bind to NPCILI may function as an agonist or 
antagonist of NPCILI (e.g., by inhibition of cholesterol uptake). 

SPA Assay. NPCILI antagonists or agonists may also be measured using scintillation 
proximity assays (SPA). SPA assays are conventional and very well known in the art; see, for 
example, U.S. Patent No. 4,568,649. In SPA, the target of interest is immobilised to a small 
microsphere approximately 5 microns in diameter. The microsphere, typically, includes a soUd 
scintillant core which has been coated with a polyhydroxy film, which in tum contains coupling 
molecules, which allow generic links for assay design. When a radioisotopically labeled molecule 
binds to the microsphere, the radioisotope is brought into close proximity to the scintillant and 
effective energy transfer from electrons emitted by the isotope will take place resulting in the 
emission of light. While the radioisotope remains in free solution, it is too distant from the 
scintillant and the electron will dissipate the energy into the aqueous medium and therefore remain 
undetected. Scintillation may be detected with a scintillation counter. In general, and ^^^I labels 
are well suited to SPA. 

For the assay of receptor-mediated binding events, the lectin wheat germ agglutinin (WGA) 
may be used as the SPA bead coupling molecule (Amersham Biosciences; Piscataway, NJ). The 
WGA coupled bead captures glycosylated, cellular membranes and glycoproteins and has been 
used for a wide variety of receptor sources and cultured cell membranes. The receptor is 
immobilized onto the WGA-SPA bead and a signal is generated on binding of an isotopically 
labeled ligand. Other coupling molecules which may be useftil for receptor binding SPA assays 
include poly-L-lysine and WGA/polyethyleneimine (Amersham Biosciences; Piscataway, NJ). 
See, for example. Berry, J.A., et a/., (1991) Cardiovascular Pharmacol. 17 (Suppl.7): S143-S145; 
Hoffinan, R., et al., (1992) Anal. Biochem. 203: 70-75; Kienhus, et al., (1992) J. Receptor 
Research 12: 389-399; Jing, S., et a/., (1992) Neuron 9: 1067-1079. 

The scintillant contained in SPA beads may include, for example, yttrium silicate (YSi), 
yttrium oxide (YOx), diphenyloxazole or polyvinyltoluene (PVT) which acts as a soUd solvent for 
diphenylanthracine (DP A). 

SPA assays may be used to analyze whether a sample is an NPCILI antagonist or agonist. 
In these assays, a host cell which expresses NPCILI (e.g., SEQ ID NO: 2 or 4 or 12) on die cell 



wo 2004/009772 PCT/US2003/022467 

-24- 

surface or a membrane firaction thereof is incubated with SPA beads (e.g., WGA coated YOx beads 
or WGA coated YSi beads) and labeled, known ligand or agonist or antagonist {e.g., H- 
cholesterol, ^H-ezetimibe or ^^^I-ezetimibe). The assay mixture further includes either the sample 
to be tested or a blank {e.g., water). After an optional incubation, scintillation is measured using a 
scintillation counter. An NPCILI agonist or antagonist may be identified in the sample by 
measuring substantially reduced fluorescence, compared to what would be measured in the absence 
of such agonist or antagonist (blank). Measuring substantially reduced fluorescence may suggest 
that the sample contains a substance which competes for NPCILI binding with the known ligand, 

agonist or antagonist 

Altematively, a sample may be identified as an antagonist or agonist of NPCILI by 
direcfly detecting binding in a SPA assay. In this assay, a labeled version of a candidate compound 
to be tested may be put in contact with the host cell expressing NPCILI or a membrane fraction 
thereof which is bound to the SPA bead. Huorescence may then be assayed to detect the presence 
of a complex between the labeled candidate compound and the host cell or membrane fraction 
expressing NPCILI. A candidate compound which binds to NPCILI may possess NPCILI 
agonistic or antagonistic activity. 

Host cells expressing NPCILI may be prepared by transforming or transfecting a nucleic 
acid encoding an NPCILI of the invention into an appropriate host cell, whereby the receptor 
becomes incorporated into the membrane of the cell. A membrane fraction can then be isolated 
from the cell and used as a source of the receptor for assay. Altematively, the whole cell 
expressing the receptor on the cell surface can be used in an assay. Preferably, specific binding of 
the labeled ligand or known antagonist or agonist to an untransfected/untransfonned host cell or 
membrane fraction from an untransfected/untransformed host cell will be negligible. Preferred 
host cells include Chinese Hamster Ovary (CHO) cells, murine macrophage ill A cells or any other 
macrophage cell line and human intestinal epitheUal Caco2 cells. 

Cholesterol Uptake Assay. Assays may also be performed to determine if a sample can 
agonize or antagonize NPCILI mediated cholesterol uptake. In these assays, a host cell expressing 
NPCILI {e.g., SEQ ID NO: 2 or 4 or 12) on the cell surface (discussed above) can be contacted 
with detectably labeled cholesterol {e.g., ^H-cholesterol or *^I-cholesterol) along with either a 
sample or a blank. After an optional incubation, the cells can be washed to remove unabsorbed 
cholesterol. Cholesterol uptake can be determined by detecting the presence of labeled cholesterol 
in the host cells. For example, assayed cells or ly sates or fractions thereof {e.g., fractions resolved 
by thin4ayer chromatography) can be contacted with a liquid scintillant and scintillation can be 
measured using a scintillation counter. 
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In these assays, an NPCILI antagonist in the sample may be identified by measuring 
substantially reduced uptake of labeled cholesterol (e.g., ^H-cholesterol), compared to what would 
be measured in the absence of such an antagonist and an agonist may be identified by measuring 
substantially increased uptake of labeled cholesterol (e.g., ^H-cholesterol), compared to what 
would be measured in the absence of such an agonist. 

Pharmaceutical Compositions 

NPCILI agonists and antagonists discovered, for example, by the screening methods 
described above may be used therapeutically (e.g., in a pharmaceutical composition) to stimulate or 
block the activity of NPCILI and, thereby, to treat any medical condition caused or mediated by 
the receptors. For example, the antibody molecules of the invention may also be used 
therapeutically (e.g., in a pharmaceutical composition) to bind NPCILI and, thereby, block the 
ability of the receptor to bind cholesterol. Blocking the binding of the cholesterol may prevent 
absorption of the molecule (e.g., by intestinal cells such as enterocytes). Blocking absorption of 
cholesterol may be a useful way to lower serum cholesterol levels in a subject and, thereby, reduce 
the incidence of, for example, hyperlipidemia, atherosclerosis, coronary heart disease, stroke or 
arteriosclerosis. 

The term "subject" or "patient" includes any organism, preferably animals, more preferably 
mammals (e.g., mice, rats, rabbits, dogs, horses, primates, cats) and most preferably humans. 

The term "pharmaceutical composition" refers to a composition including an active 
ingredient and a pharmaceutically acceptable carrier and/or adjuvant. 

Although the compositions of this invention could be administered in simple solution, they 
are more typically used in combination with other materials such as carriers, preferably 
pharmaceutically acceptable carriers. Useful, pharmaceutically acceptable carriers can be any 
compatible, non-toxic substances suitable for delivering the compositions of the invention to a 
subject. Sterile water, alcohol, fats, waxes, and inert solids may be included in a pharmaceutically 
acceptable carrier. Pharmaceutically acceptable adjuvants (buffering agents, dispersing agents) 
may also be incorporated into the pharmaceutical composition. 

Preferably, the pharmaceutical compositions of the invention are in the form of a pill or 
capsule. Methods for formulating pills and capsules are very well known in the art. For example, 
for oral administration in the form of tablets or capsules, the active drug component may be 
combined with any oral, non-toxic pharmaceutically acceptable inert carrier, such as lactose, 
starch, sucrose, cellulose, magnesium stearate, dicalcium phosphate, calcium sulfate, talc, 
mannitol, ethyl alcohol (liquid forms) and the like. Moreover, when desired or needed, suitable 
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binders, lubricants, disintegrating agents and coloring agents may also be incorporated in the 
mixture. Suitable binders include starch, gelatin, natural sugars, com sweeteners, natural and 
synthetic gums such as acacia, sodium alginate, carboxymethylcellulose, polyethylene glycol and 
waxes. Among the lubricants there may be mentioned for use in these dosage forms, boric acid, 
sodium benzoate, sodium acetate, sodium chloride, and the Uke. Dismtegrants include starch, 
methylcellulose, guar gum and the like. Sweetening and flavoring agents and preservatives may 
also be included where appropriate. 

The pharmaceutical compositions of the invention may be administered in conjunction with 
a second pharmaceutical composition or substance. In preferred embodiments, the second 
composition includes a cholesterol-lowering drug. When a combination therapy is used, both 
compositions may be formulated into a single composition for simultaneous delivery or formulated 
separately into two or more compositions (e.g., a kit). 

The formulations may conveniently be presented in unit dosage form and may be prepared 
by any methods well known in the art of pharmacy. See, e,g,. Oilman et aL (eds.) (1990), The 
Pharmacological Bases of Therapeutics , 8th Ed., Pergamon Press; and Remington's Pharmaceutical 
Sciences , supra, Easton, Penn.; Avis et aL (eds.) (1993) Pharmaceutical Dosage Forms: Parenteral 
Medications Dekker, New York; Lieberman et aL (eds.) (1990) Pharmaceutical Dosage Forms; 
Tablets Dekker, New York; and Lieberman et aL (eds.) (1990), Pharmaceutical Dosage Forms: 
Disperse Systems Dekker, New York. 

The dosage regimen involved in a therapeutic application may be determined by a 
physician, considering various factors which may modify the action of the therapeutic substance, 
e.g,, the condition, body weight, sex and diet of the patient, the severity of any infection, time of 
administration, and other clinical factors. Often, treatment dosages are titrated upward from a low 
level to optimize safety and efficacy. Dosages may be adjusted to account for the smaller 
molecular sizes and possibly decreased half-lives (clearance times) following administration. 

An "effective amount" of an antagonist of the invention may be an amount that will 
detectably reduce the level of intestinal cholesterol absorption or detectably reduce the level of 
serum cholesterol in a subject administered the composition. 

Typical protocols for the therapeutic administration of such substances are well known in 
the art. Pharmaceutical composition of the invention may be administered, for example, by any 
parenteral or non-parenteral route. 

Pills and capsules of the invention can be administered orally. Injectable compositions can 
be administered with medical devices known in the art; for example, by injection with a 
hypodermic needle. 



wo 2004/009772 



PCT/US2003/022467 



-27- 

Injectable pharmaceutical compositions of the invention may also be administered with a 
needleless hypodermic injection device; such as the devices disclosed in U.S. Patent Nos. 
5,399,163; 5,383,851; 5,312,335; 5,064,413; 4,941,880; 4,790,824 or 4,596,556. 

Anti-Sense 

The present invention also encompasses anti-sense oligonucleotides capable of specifically 
hybridizing to mRNA encocHng NPCILI (e.g., any of SEQ ID NOs: 1, 3, 5-1 1 or 13) having an 
amino acid sequence defined by, for example, SEQ ID NO: 2 or 4 or 12 or a subsequence thereof 
so as to prevent translation of the mJElNA. Additionally, this invention contemplates anti-sense 
oligonucleotides capable of specifically hybridizing to the genomic DNA molecule encoding 
NPCILI, for example, having an amino acid sequence defined by SEQ ID NO: 2 or 4 or 12 or a 
subsequence thereof. 

This invention further provides pharmaceutical compositions comprising (a) an amount of 
an oligonucleotide effective to reduce NPCILI -mediated cholesterol absorption by passing through 
a cell membrane and binding specifically with mRNA encoding NPCILI in the cell so as to 
prevent its translation and (b) a phamiaceutically acceptable carrier capable of passing through a 
cell membrane. In an embodiment, the oligonucleotide is coupled to a substance that inactivates 
mRNA. In another embodiment, the substance that inactivates mRNA is a ribozyme. 

EXAMPLES 

The following examples are provided to more clearly describe the present invention and 
should not be construed to limit the scope of the invention in any way. 

Example 1 ; Cloning and Expression of Rat, Mouse and Human NPCILI. 

Rat JVPC, mouse NPCILI or human NPCILI can all conveniently be amplified using 
polymerase chain reaction (PCR). In this approach, DNA from a rat, mouse or human cDNA 
library can be amplified using appropriate primers and standard PCR conditions. Design of 
primers and optimal amplification conditions constitute standard techniques which are commonly 
known in the art. 

An amplified NPCILI gene may conveniently be expressed, again, using methods which 
are commonly known in the art. For example, NPCILI may be inserted into a pET-based plasmid 
vector (Stratagene; La Joola, CA), downstream of the T7 RNA polymerase promoter. The plasmid 
may then be transformed into a T7 expression system (e.g., BL21DE3 E.coli cells), grown in a 
liquid culture and induced (e.g., by adding IPTG to the bacterial culture). 
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Example 2 : Direct Binding Assay. 

Membrane preparation : Caco2 cells transfected with an expression vector containing a 
polynucleotide encoding NPCILI (e.g., SEQ ID NO: 2, 4 or 12) are harvested by incubating in 5 

5 mM EDTA/phosphate-buffered saline followed by repeated pipeting. The cells are centrifuged 5 
min at 1000 x g. The EDTA/PBS is decanted and an equal volume of ice-cold 50mM Tris-HCl, pH 
7.5 is added and cells are broken up with a Polytron (PTIO tip, setting 5, 30 sec). Nuclei and 
unbroken cells are sedimented at 1000 x g for 10 min and then the supematant is centrifuged at 
50,000 X g for 10 min. The supematant is decanted, the pellet is resuspended by Polytron, a sample 

10 is taken for protein assay (bicinchoninic acid. Pierce), and the tissue is again centrifuged at 50,000 
X g. Pellets are stored frozen at -20°C. 

Binding assay : For saturation binding, four concentrations of [ H]-ezetimibe (15 Ci/mmol) 
are incubated without and with 10"^ M ezetimibe in triplicate with 50 [ig of membrane protein in a 
total volume of 200 fil of 50 mM Tris-HCl, pH 7.5, for 30 min at 30°C. Samples are filtered on 

15 GF/B filters and washed three times with 2 ml of cold Tris buffer. Filters are dried in a microwave 

oven, impregnated with Meltilex wax scintillant, and counted at 45% efficiency. For competition 

'J 

binding assays, five concentrations of a sample are incubated in triplicate with 18 nM [ H]- 
ezetimibe and 70 fig of membrane protein under the conditions described above. Curves are fit to 
the data with Prism (GraphPad Software) nonlinear least-squares curve-fitting program and Ki 
20 values are derived firom IC50 values according to Cheng and Prusoff (Cheng, Y. C, et aL, (1973) 
Biochem. Pharmacol. 22:3099-3108). 

Example 3 : SPA Assay, 

For each well of a 96 well plate, a reaction mixture of 10 fxg human, mouse or rat NPCILI- 

25 CHO overexpressing membranes (Biosignal) and 200 jLig/well YSi-WGA-SPA beads (Amersham) 
in 100 |Lil is prepared in NPCILI assay buffer (25 mM HEPES, pH 7.8, 2 mM CaCl2, ImM MgCl2, 
125 mM NaCl, 0.1% BSA). A 0.4 nM stock of ligand- [^^^I]-ezetimibe- is prepared in the NPCILI 
assay buffer. The above solutions are added to a 96-well assay plate as follows: 50 |Uil NPCILI 
assay buffer, 100 |jl1 of reaction mixture, 50 \x\ of ligand stock (final ligand concentration is 0.1 

30 nM). The assay plates are shaken for 5 minutes on a plate shaker, then incubated for 8 hours 
before cpm/well are determined in Microbeta Trilux counter (PerkinElmer). 

These assays will indicate that [^^^I]-ezetimibe binds to the cell membranes expressing 
human, mouse or rat NPCILI. Similar results will be obtained if the same experiment is 
performed with radiolabeled cholesterol {e.g., I-cholesterol). 
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Example 4 : Cholesterol Uptake Assay. 

CHO cells expressing either SR-Bl or three different clones of rat NPCILI or one clone of 
mouse NPCILI were starved overnight in cholesterol free media then dosed with [^H]-cholesterol 
in a mixed synthetic micelle emulsion for 4 min, 8 min ,12 min or 24 min in the absence or 
presence of 10 lUiM ezetimibe. The cells were harvested and the lipids were organically extracted. 
The extracted lipids were spotted on thin-layer chromatography (TLC) plates and resolved within 
an organic vapor phase. The free cholesterol bands for each assay were isolated and counted in a 
scintillation counter. 

The SR-Bl expressing cells exhibited an increase in [^H] -cholesterol uptake as early as 4 
min which was also inhibited by ezetimibe. The three rat clones and the one mouse clone appeared 
to give background levels of [^H] -cholesterol uptake which was similar to that of the 
untransformed CHO cell. 

These experiments will yield data demonstrating that CHO cells can perform mouse, rat 
and human NPClLl-dependent uptake of [^H] -cholesterol when more optimal experimental 
conditions are developed. 

Example 5 ; Expression of Rat NPCILI in Wistar Rat Tissue. 

In these experiments, the expression of rat NPCILI mRNA, in several rat tissues, was 
evaluated. The tissues evaluated were esophagus, stomach, duodenum, jejunum, ileum, proximal 
colon, distal colon, liver, pancreas, heart, aorta, spleen, lung, kidney, brain, muscle, testes, ovary, 
uterus, adrenal gland and thyroid gland. Total RNA samples were isolated from at least 3 male and 
3 female animals and pooled. The samples were then subjected to real time quantitative PCR using 
Taqman analysis using standard dual-labeled fluorogenic oligonucleotide probes. Typical probe 
design incorporated a 5' reporter dye {e.g., 6FAM (6-carboxyfluorescein) or VIC) and a 3' 
quenching dye {e.g.^ TAMRA (6-carboxytetramethyl-rhodamine)). 

idit NPClLh 

Forward: TCTTCACCCTTGCTCTTTGC (SEQ ID NO: 14) 

Reverse: AATGATGGAGAGTAGGTTGAGGAT (SEQ ID NO: 15) 

Probe: [6FAM]TGCCCACCTTTGTTGTCTGCTACC[TAMRA] (SEQ ID NO: 16) 

rat p-actin : 

Forward: ATCGCTGACAGGATGCAGAAG (SEQ ID NO: 17) 
Reverse: TCAGGAGGAGCAATGATCTTGA (SEQ ID NO: 18) 
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Probe: [VIC]AGATTACTGCCCTGGCTCCTAGCACCAT[TAMRA] (SEQ ID NO: 19) 
PGR reactions were run in 96- well format with 25 |lx1 reaction mixture in each well 

containing: Platinum SuperMix (12.5 jxl), ROX Reference Dye (0.5 ul), 50 mM magnesium 

chloride (2 |li1), cDNA from RT reaction (0.2 fxl). Multiplex reactions contained gene specific 
5 primers at 200 nM each and FAM labeled probe at 100 nM and gene specific primers at 100 nM 

each and VIC labeled probe at 50 nM. Reactions were run with a standard 2-step cycling program, 

95^ C for 15 sec and 60^ C for 1 min, for 40 cycles. 

The highest levels of expression were observed in the duodenum, jejunum and ileum tissue. 

These data indicate that NPCILI plays a role in cholesterol absorption in the intestine. 

10 

Example 6 ; Expression of Mouse NPCILI in Mouse Tissue. 

In these experiments, the expression of mouse NPCILI mRNA, in several tissues, was 
evaluated. The tissues evaluated were adrenal gland, BM, brain, heart, islets of langerhans, LI, 
small intestine, kidney, liver, lung, MLN, PLN, muscle, ovary, pituitary gland, placenta, Peyers 
15 Patch, skin, spleen, stomach, testes, thymus, thyroid gland, uterus and trachea. Total RNA samples 
were isolate from at least 3 male and 3 female animals and pooled. The samples were then 
subjected to real time quantitative PGR using Taqman analysis using the following primers and 
probes: 

mouse iVPCiLi : 

20 Forward: ATCCTCATCCTGGGCTTTGC (SEQ ID NO: 20) 

Reverse: GCAAGGTGATCAGGAGGTTGA (SEQ ID NO: 21) 

Probe: [6FAM]CCCAGCTTATCCAGATTTTCTTCTTCCGC[TAMRA] (SEQ ID NO: 22) 

The highest levels of expression were observed in the Peyer's Patch, small intestine, gall 
bladder and stomach tissue. These data are consistent with a cholesterol absorption role for 
25 NPCILI which takes place in the digestive system. 

Example 7 ; Expression of Human NPCILI in Human Tissue. 

In these experiments, the expression level of human NPCILI mRNA was evaluated in 
2045 samples representing 46 normal tissues. Microarray-based gene expression analysis was 
30 performed on the Affymetrix HG-U95 GeneChip using a cRNA probe corresponding to base pairs 
4192-5117 (SEQ ID NO: 43) in strict accordance to Affymetrix's established protocols. Gene 
Chips were scanned under low photo multiplier tube (PMT), and data were normalized using either 
Affymetrix MAS 4.0 or MAS 5.0 algorithms. In addition ''spike ins" for most samples were used 
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to construct a standard curve and obtain RNA concentration values according Gene Logic 
algorithms and procedures. A summary of these results are indicated, below, in Table 2. 



Table 2. Expression level otNPClLl mRNA in various human tissues. 



5 



Tissue 


Present 


Absent 


Lower 
25% 


Median 


Upper 
75% 




Tissue 


Present 


Absent 


Lower 

25% 


Median 


Upper 

75% 


Adipose 


2 of 32 


30 of 32 


-2.45 


1.16 


12.23 




tlver 


32 of 34 ' 


•"V .2 of 34 • • V ; 


325.74 


427.77 


: 540,1 


Adrenal Oland 


Oof 12 


12 of 12 


-23.54 


-4.47 


10.51 




Lung 


2 of 93 


91 of 93 


-3.47 


11.03 


22.34 


Appendix 


Oof 3 


3 of 3 


-8.02 


-6.69 


38.19 




Lymph Node 


Oof 11 


11 of 11 


-1.78 


-0.19 


1.34 


Artery 


0 of 3 


3 of 3 


-6.59 


-4.67 


9.68 




Muscles 


0 of 39 


39 of 39 


-21.57 


8.25 


26.73 


Bladder 


1 of 5 


4 of 5 


-22 


-7.95 


-1.99 




Myometrium 


8 of 106 


98 of 106 


-3.98 


4.87 


17.55 


Bone 


Oof 3 


3 of 3 


-1.64 


3.3 


19.53 




Omentum 


Oof 15 


15 of 15 


-14.25 


•1.6 


19.58 


Breast 


4 of 80 


76 of 80 


-4.07 


3.13 


14.67 




Ovary 


1 of 74 


73 of 74 


0.5 


17.51 


38.28 


Cerebellum 


0 of 5 


5 of 5 


-3.04 


3.24 


15.38 




Pancreas 


Oof 34 


34 of 34 


-87.08 


-53.2 


-24.14 


Cervix 


3 of 101 


9S of 101 


-7.56 


-0.07 


20.89 




Placenta 


0 of 5 


SofS 


-20.4 


-3.44 


18.91 


Colon 


9 of 151 


142 of 151 


-10.19 


0.31 


18.36 




Prostate 


0 of 32 


32 of 32 


1.08 


15.56 


27.24 


Cortex Frontal Lobe 


Oof 7 


7 of 7 


1.4 


8.46 


11.75 




Rectum 


1 of 43 


42 of 43 


-9.26 


-1.49 


9.8 


Cortex Temooral Lobe 


Oof 3 


3 of 3 


7.1 


8.5 


15.87 




Mght Atrium 


4 of 169 


165 of 169 


-19.32 


•6.58 


7.72 


Duodenum 


59 of 61 


,2of6i ; 


519.23 


827.43 


1101.67 




Right Ventride 


1 of 160 


159 of 160 


-24.01 


-6.49 


10.06 


Endometrium 


Oof 21 


21 of 21 


-14.43 


-6.39 


2.79 




Sktn 


0 of 59 


59 of 59 


-12.68 


1.5 


22.77 


Esophagus 


1 of 27 


26 of 27 


-10.93 


-4.97 


12.48 




SmaUXntestirte 


46 Of 68 


22 of 68 


•:• .. .. .21.21 


493.93 


939.2 


Fattoptan Tube 


3 of 51 


48 of 51 


5.02 


13.24 


26.77 




Soft Tissues 


1 of 6 


5 of 6 


-1.99 


2.6 


5.32 


iGaltBiadder 


8of8 




205.76 


273.39 


- 42Z.B 




Spleen 


Oof 31 


31 of 31 


-9.41 


-0.31 




Heart 


Oof 3 


3 of 3 


3.33 


11.19 


11.66 




Stomach 


Tof47 


40 of 47 


.19,02 


52.29 




Hippocampus 


Oofs 


SofS 


8.25 


9,11 


19.83 




Testis 


0 of 5 


SofS 


-4.51 


102 


11.2 


Kidney 


4 of 86 


82 of 86 


-8.36 


3.41 


16.46 




Thymus 


1 of 71 


70 of 71 


-6.26 


2.51 


11.67 


Larynx 


0 of 4 


4 of 4 


-13.76 


-0.81 


8.54 




Thyroid Gland 


1 of 18 


17 of 18 


-12.2Z 


2.84 


17.86 


Left Atrium 


2 of 141 


139 of 141 


-18.9 


-4.58 


fi.84 




Uterus 


Oof 58 


58 of 58 


-10.67 


1.59 


16.01 


Left Ventride 


Oof 15 


IS of 15 


-21.19 


-9.59 


17.7 




W8C 


3 of 40 


37of40 


-16.45 


-0.72 


25.18 



Shaded data corresponds to tissues wherein the highest levels of NPCILI mRNA was 
detected. The "Present" column indicates the proportion of specified tissue samples evaluated 
wherem NPCILI mRNA was detected. The "Absent" column indicates the proportion of specified 
tissue samples evaluated wherein NPCILI RNA was not detected. The "lower 25%", "median" 
10 and "upper 75%" columns indicate statistical distribution of the relative NPCILI signal intensities 
observed for each set of tissue evaluated. 

Example 8 : Distribution of Rat NPCILI, Rat IBAT or Rat SR-Bl mRNA in Rat Small 
Intestine. 

15 In these experiments, the distribution of rat NPCILI mRNA along the proximal-distal axis 

of rat small intestines was evaluated. Intestines were isolated from five independent animals and 
divided into 10 sections of approximately equal length. Total RNA was isolated and analyzed, by 
real time quantitative PGR using Taqman analysis, for localized expression levels of rat NPCILI^ 
rat IBAT (ileal bile acid transporter) or rat SR-Bl mRNA. The primers and probes used in the 

20 analysis were: 

x^t NPCILI : 

Forward: TCTTCACCCTTGCTCTTTGC (SEQ ID NO: 23) 
Reverse: AATGATGGAGAGTAGGTTGAGGAT (SEQ ID NO: 24) 
25 Probe: [6FAM]TGGGGAGGTTTGTTGTCTGCTACC[TAMRA] (SEQ ID NO: 25) 
rat Villin: 
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Forward: AGCACCTGTCCACTGAAGATTTC (SEQ ID NO: 26) 

Reverse: TGGACGCTGAGCTTCAGTTCT (SEQ ID NO: 27) 

Probe: [VIC]CTTCTCTGCGCTGCCTCGATGGAA[TAMRA] (SEQ ID NO: 28) 

mtSR-Bl : 

5 Forward: AGTAAAAAGGGCTCGCAGGAT (SEQ ID NO: 29) 
Reverse: GGCAGCTGGTGACATCAGAGA (SEQ ID NO: 30) 

Probe: [6FAM]AGGAGGCCATGCAGGCCTACTCTGA[TAMRA] (SEQ ID NO: 31 ) rat IBAT : 
Forward: GAGTCCACGGTCAGTCCATGT (SEQ ID NO: 32) 
Reverse: TTATGAACAACAATGCCAAGCAA (SEQ ID NO: 33) 
10 Probe: [6FAM]AGTCCTTAGGTAGTGGCTTAGTCCCTGGAAGCTC[TAMRA] (SEQ ID NO: 

34) 

The mRNA expression levels of each anunal intestinal section were analyzed separately, 
then the observed expression level was normalized to the observed level of villin mRNA in that 
intestinal section. The observed, normalized mRNA expression levels for each section where then 
15 averaged. 

The expression level of NPCILI and SR-Bl were highest in the jejunmn (sections 2-5) as 
compared to that of the more distal ileum sections. Since the jejunum is believed to be the site of 
cholesterol absorption, these data suggest such a role for rat NPCILI. /BAT distribution favoring 
the ileum is well document and served as a control for the experiment. 

20 

Example 9 ; In situ Analysis of Rat NPCILI mRNA in Rat Jejunum Tissue, 

The localization of rat NPCILI mRNA was characterized by in situ hybridization analysis 
of rat jejunum serial sections. The probes used in this analysis were: 



25 T7-sense probe : GTAATACGACTCACTATAGGGCCCTGACGGTCCTTCCTGA 
GGGAATCTTCAC (SEQ ID NO: 35) 

T7-antisense probe : GTAATACGACTCACTATAGGGCCTGGGAAGTTGGTCAT 
GGCCACTCCAGC (SEQ ID NO: 36) 

The RNA probes were synthesized using T7 RNA polymerase amplification of a PGR 
30 amplified DNA fragment corresponding rat NPCILI nucleotides 3318 to 3672 (SEQ ID NO 1). 

Sense and anti-sense digoxigenin-UTP labeled cRNA probes were generated from the T7 promoter 
using the DIG RNA Labeling Kit following the manufacturer's instructions. Serial cryosections rat 
jejunum were hybridized with the sense and antiisense probes, Digoxigenin labeling was detected 
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with the DIG Nucleic Acid Detection Kit based on previous methods. A positive signal is 
characterized by the deposition of a red reaction product at the site of hybridization. 

The anti-sense probe showed strong staining of epithelium along the crypt-villus axis under 
low magnification (40X). The observed rat NPCILI mRNA expression levels may have been 
somewhat greater in the crypts than in the villus tips. Under high magnification (200X), staining 
was observed in the enterocytes but not in the goblet cells. A lack of staining observed with the 
sense probe (control) confirmed tlie high specificity of the NPCILI anti-sense signal. These data 
provided further evidence of the role of rat NPCILI in intestinal cholesterol absorption. 

Example 10 : FACS Analysis of Fluorescently Labeled Ezetimibe Binding to Transiently 

Transfected CHO Cells. 

In these experiments, the ability of BODlPY-labeled ezetimibe (Altmann, et al, (2002) 
Biochim. Biophys. Acta 1580(l):77-93) to bind to NPCILI and SR-Bl was evaluated. "BODIPY" 
is a fluorescent group which was used to detect the BODlPY-ezetimibe. Chinese hamster ovary 
(CHO) cells were transiently transfected with rat NPCILI DNA (rNPClLl/CHO), mouse NPCILI 
DNA (mNPClLl/CHO), mouse SR-Bl DNA (mSRBI/CHO) or EGFP DNA (EGFP/CHO). EGFP 
is enhanced green fluorescent protein which was used as a positive control. The transfected CHO 
cells or untransfected CHO cells were then stained with 100 nM BODIPY-labeled ezetimibe and 
analyzed by FACS. Control experiments were also performed wherein the cells were not labeled 
with the BODlPY-ezetimibe and wherein untransfected CHO cells were labeled with the BODIPY- 
ezetimibe. 

No staining was observed in the untransfected CHO, rNPClLl/CHO or mNPClLl/CHO 
cells. Fluorescence was detected in the positive-control EGFP/CHO cells. Staining was also 
detected in the mouse SR-Bl/CHO cells. These data show that, under the conditions tested, 
BODIPY-ezetimibe is capable of binding to SR-Bl and that such binding is not ablated by the 
presence of the fluorescent BODIPY group. When more optimal conditions are determined, 
BODIPY-ezetimibe will be shown to label the rNPClLl/CHO and mNPClLl/CHO cells. 

Example 11 : FACS Analysis of Transiently Transfected CHO Cells Labeled with Anti- 

FLAG Antibody M2. 

In these experiments, the expression of FLAG-tagged NPCILI on CHO cells was 
evaluated. CHO cells were transiently transfected witihi mouse NPCILI DNA, rat NPCILI DNA, 
FLAG- rat NPCILI DNA or FLAG- mouse NPCILI DNA. The 8 amino acid FLAG tag used was 
DYKDDDDK (SEQ ID NO: 37) which was inserted on the amino-terminal extracellular loop just 
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past the secretion signal sequence. The cells were incubated with commercially available anti- 
FLAG monoclonal mouse antibody M2 followed by a BODDPY-tagged anti-mouse secondary 
antibody. The treated cells were then analyzed by FACS. 

The M2 antibody stained the CHO cells transfected with FLAG-rat NPCILI DNA and with 
5 FLAG-mouse NPCILI. No staining was observed in the CHO cells transfected with mouse 
NPCILI DNA and with rat NPCILI DNA. These data showed that rat NPCILI and mouse 
NPCILI possess no significant, inherent fluorescence and are not bound by the anti-FLAG 
antibody. The observed, FLAG-dependent labeling of the cells indicated that the FLAG-mouse 
NPCILI and FLAG-rat NPCILI proteins are localized at the cell membrane of the CHO cells. 

10 

Example 12 : FACS Analysis of FLAG-rat NPCILI-EGFP Chimera in Transiently 
Transfected CHO Cells. 

In these experiments, the surface and cytoplasmic localization of rat NPCILI in CHO cells 
was evaluated. CHO cells were transiently transfected with FLAG- rat NPCILI DNA or with 

15 FLAG-rat NPCILI-EGFP DNA. In these fusions, the FLAG tag is at amino-terminus of rat 

NPCILI and EGFP fusion is at the carboxy-terminus of rat NPCILI. The cells were then stained 
with the M2 anti-FLAG mouse (primary) antibody followed by secondary staining with a 
BODIPY-labeled anti-mouse antibody. In control experiments, cells were stained with only the 
secondary antibody and not with tlie primary antibody (M2). The stained cells were then analyzed 

20 by FACS. 

In a control experiment, ELAG-rat NPCILI transfected cells were stained with BODIPY 
anti-mouse secondary antibody but not with the primary antibody. The data demonstrated that the 
secondary, anti-mouse antibody possessed no significant specificity for FLAG-rat NPCILI and 
that the ELAG-rat NPCILI, itself, possesses no significant fluorescence. 
25 In another control experiment, unlabeled FLAG-rat NPCILI-EGFP cells were FACS 

analyzed. In these experiments, autofluorescence of the enhanced green fluorescent protein 
(EGFP) was detected. 

ELAG-rat NPCILI cells were stained with anti-FLAG mouse antibody M2 and with the 
BODIPY-labeled anti-mouse secondary antibody and FACS analyzed. The data from this analysis 
30 showed that the cells were labeled with the secondary, BODIPY-labeled antibody which indicated 
expression of the FLAG-rat NPCILI protein on the surface of the CHO cells. 

ELAG-rat NPCILI-EGFP cells were stained with anti-FLAG mouse antibody M2 and with 
the BODIPY-labeled anti-mouse secondary antibody and FACS analyzed. The data from this 
analysis showed that both markers (BODIPY and EGFP) were present indicating surface 
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expression of the chimeric protein. The data also indicated that a portion of the protein was located 
within the cells and may be associated with transport vesicles. These data supported a role for rat 
NPCILI in vesicular transport of cholesterol or protein expressed in subcellular organelles such as 
the rough endoplasmic reticulum. 

5 

Example 13 ; FACS Analysis and Fluorescent Microscopy of FLAG-rat NPCILI-EGFP 
Chimera in a Cloned CHO Cell Line. 

In these experiments, the cellular localization of rat NPCILI was evaluated by FACS 
analysis and by immunohistochemistry. CHO cells were transfected with FLAG-rat NPCILI- 
10 EGFP DNA and stained with anti-FLAG mouse antibody M2 and then with a BODIPY-labeled 
anti-mouse secondary antibody. In the fusion, the FLAG tag is at the amino-terininus of rat 
NPCILI and the enhanced green fluorescent protein (EGFP) tag is located at the carboxy-terminus 
of the rat NPCILL The stained cells were then analyzed by FACS and by fluorescence 
microscopy. 

15 Cells transfected with FLAG-rat M^'CiLi -EGFP DNA were stained with the anti-FLAG 

mouse antibody M2 and then with the BODIPY-labeled anti-mouse secondary antibody. FACS 
analysis of the cells detected both markers indicating surface expression of the chimeric protein. 

FLAG-rat NPCILI -EGFP transfected cells were analyzed by fluorescent microscopy at 
63X magnification. Fluorescent microscopic analysis of the cells indicated non-nuclear staining 

20 with significant perinuclear organelle staining. Resolution of the image could not confirm the 
presence of vesicular associated protein. These data indicated that the fusion protein was 
expressed on the cell membrane of CHO cells. 

Example 14 : Generation of Polyclonal Anti-rat NPCILI Rabbit Antibodies. 

25 Synthetic peptides (SEQ ID NO: 39-42) containing an amino- or carboxy-terminal cysteine 

residue were coupled to keyhole limpet hemocyanin (KLH) carrier protein through a disulfide 
linkage and used as antigen to raise polyclonal antiserum in New Zealand white rabbits (range 3-9 
months in age). The KLH-peptide was emulsified by mixing with an equal volume of Freund's 
Adjuvant, and injected into three subcutaneous dorsal sites. Prior to the 16 week immunization 

30 schedule a pre-immune sera sample was collected which was followed by a primary injection of 
0.25 mg KLH-peptide and 3 scheduled booster injections of 0.1 mg KLH-peptide. Animals were 
bled from the auricular artery and the blood was allowed to clot and the serum was then collected 
by centrifugation 
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The anti-peptide antibody titer was determined with an enzyme linked immunosorbent 
assay (ELISA) with free peptide bound in solid phase (l|ag/well). Results are expressed as the 
reciprocal of the serum dilution that resulted in an OD450 of 0,2, Detection was obtained using the 
biotinylated anti-rabbit IgG, horse radish peroxidase-streptavidin (HRP-SA) conjugate, and ABTS. 

5 

Example 15 ; FACS Analysis of Rat NPCILI Expression in CHO Cells Transiently 
Transfected with Rat NPCILI DNA Using Rabbit Anti-rat NPCILI Antisera. 

In these experiments, the expression of rat NPCILI on the surface of CHO cells was 
evaluated. CHO cells were transfected with rat NPCILI DNA, then incubated with either rabbit 

10 preimmune serum or with 10 week anti-rat NPCILI serum described, above, in Example 14 

A0715, A0716, A0867 or A0868), Cells labeled with primary antisera were then stained with a 
BODIPY-modified anti-rabbit secondary antibody followed by FACS analysis. 

No antibody surface labeling was observed for any of the pre-immune sera samples. 
Specific cell surface labeling of rat NPCILI transfected cells was observed for both A0715 and 

15 A0868. Antisera A0716 and A0867 did not recognize rat NPCILI surface expression in this assay 
format. This indicates that the native, unfused rat NPCILI protein is expressed in the CHO cells 
and localized to the CHO cell membranes. Cell surface expression of NPCILI is consistent with a 
role in intestinal cholesterol absorption. 

20 Example 16 : FACS Analysis of CHO Cells Transiently Transfected with FLAG-Mouse 

NPCILI DNA or FLAG-rat A^PCiLl DNA or Untransfected CHO Cells Using Rabbit Anti- 
rat NPCILI Antisera. 

In these experiments, the expression of FLAG-mouse NPCILI and FLAG-rat NPCILI in 
CHO cells was evaluated. CHO cells were transiently transfected with FL AG-mouse NPCILI 

25 DNA or with FLAG-rat NPCILI DNA. The FLAG-mouse NPCILI and FLAG-rat NPCILI 

transfected cells were labeled with either A0801, A0802, A0715 or A0868 sera (see Example 14) 
or with anti-FLAG antibody, M2. The labeled cells were then stained with BODIPY-labeled anti- 
rabbit secondary antibody and FACS analyzed. The untransfected CHO cells were analyzed in the 
same manner as the transfected cell lines. 

30 Positive staining of the untransfected CHO cells was not observed for any of the antisera 

tested. Serum AO 801 -dependent labeling of FLAG-rat NPCILI transfected cells was observed but 
such labeling of FLAG-mouse NPCILI transfected cells was not observed. Serum A0802- 
dependent labeling of FLAG-mouse NPCILI or FLAG-rat NPCILI transfected cells was not 
observed. Strong serum A0715-dependent labeling of FLAG-rat //PCi Li transfected cells was 

35 observed and weak serum A0715-dependent labeling of FLAG-mouse NPCILI transfected cells 
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was observed. Weak serum A0868 -dependent labeling of rat NPCILI and mouse NPCILI 
transfected cells was observed. Strong Anti-FLAG M2 antibody-dependent labeling of FLAG-rat 
NPCILI and FLAG-mouse NPCILI transfected cells was observed. The strong M2 staining is 
likely to be due to the fact that M2 is an affinity-purified, monoclonal antibody of known 
5 concentration. In contrast, the respective antisera are polyclonal, unpurified and contain an 

uncertain concentration of anti-rat NPCILI antibody. These date provide further evidence that the 
FLAG-mouse NPCILI and FLAG-rat NPCILI proteins are expressed in CHO cells and localized 
to the CHO cell membranes. Cell surface expression of NPCILI is consistent with a role in 
intestinal cholesterol absorption. 

10 

Example 17 : Immunohistochemical Analysis of Rat Jejunum Tissue with Rabbit Anti-rat 
NPCILI Antisera A0715. 

In these experiments, the localization of rat NPCILI in rat jejunum was analyzed by 
immunohistochemistry. Rat jejunum was removed, immediately embedded in O.C.T, compound 

1 5 and frozen in liquid nitrogen. Sections (6|J,m) were cut with a cryostat microtome and mounted on 
glass slides. Sections were air dried at room temperature and then fixed in Bouin's fixative. 
Streptavidin-biotin-peroxidase immunostaining was carried out using Histostain-SP kit. 
Endogenous tissue peroxidase activity was blocked with a 10 minute incubation in 3% H2O2 in 
methanol, and nonspecific antibody binding was minimized by a 45 minute incubation in 10% 

20 nonimmune rabbit serum. Sections were incubated with a rabbit anti-rat NPCILI antisera A0715 
or A0868 at a 1:500 dilution at 4®C, followed by incubation with biotinylated goat anti-rabbit IgG 
and with streptavidin-peroxidase. Subsequently, the sections were developed in an aminoethyl 
carbazole (AEC)-H202 staining system and counterstained with hematoxylin and examined by 
microscopy. A positive reaction using this protocol is characterized by the deposition of a red 

25 reaction product at the site of the antigen-antibody reaction. Nuclei appeared blue from the 

hematoxylin counterstain. Controls were performed simultaneously on the neighboring sections 
from the same tissue block. Control procedures consisted of the following: (1) substitute the 
primary antibody with the pre-immune serum, (2) substitute the primary antibody with the non- 
immune rabbit serum, (3) substitute the primary antibody with PBS, (4) substitute the second 

30 antibody with PBS. 

The example shows tissue stained with anti-rat NPCILI sera A0715 or with the preimmune 
sera analyzed at low magnification (40X) and at high magnification (200X). The A0715-stained 
tissue, at low magnification, showed positive, strong staining of the villi epithelial layer 
(enter ocytes). The A0715-stained tissue at high magnification showed positive, strong staining of 
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the enterocyte apical membranes. No staining was observed in tissue treated only with preimmune 
sera. Similar results were obtained with sera A0868. These data indicate that rat NPCILI is 
expressed in rat jejunum which is consistent with a role in intestinal cholesterol absorption. 

Example 18 : Labeled Cholesterol Uptake Assay. 

In this example, the ability of CHO cells stably transfected with rat NPCILI or mouse SR- 
Bl to take up labeled cholesterol was evaluated. In these assays, cholesterol uptake, at a single 
concentration, was evaluated in a pulse-chase experiment. The data generated in these experiments 
are set forth, below, in Table 3. 

Cells: 

A. CHO cells stably transfected with rat NPCILI cDNA 

B. CHO background (no transfection) 

Cells were seeded at 500,000 cells/ well (mL) in 12-well plates. 

Procedure: 

All reagents and culture plates were maintained at 37°C unless otherwise noted. 
Starve. The maintenance media (F12 HAMS, l%Pen/Strep, 10%FCS) was removed and 
the cells were rinsed with serum-free HAMS media. The serum-free media was then replaced with 
1 mL "starve" media (F12 HAMS, Pen/Strep, 5% lipoprotein deficient serum (TLPDS). 

One plate of each cell line was starved overnight. The remaining 2 plates were designated 
"No Starve" (see below). 

Pre-Incubation. Media was removed from all plates, rinsed with serum-free HAMS and 
replaced with starve media for 30 minutes, 

^H'Cholesterol Pulse. The following was added directly to each well. 

0.5 /xCi ^H-cholesterol (-1.1 X 10^ dpm/well) in 50/xl of a mixed bile salt micelle. 
4.8mM sodium taurocholate (2.581mg/mL) 
0.6 mM sodium oleate (0.183mg/mL) 
0.25 mM cholesterol (0.1 mg/mL) 
Dispersed in "starve" media by ultrasonic vibration 
Final media cholesterol concentration = S/xg/mL 
Labeled cholesterol pulse time points were 0, 4, 12 and 24 minutes. Triplicate wells for 
each treatment were prepared. 
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Wash. At the designated times, media was aspirated and the cells were washed once with 
Hobbs Buffer A (50mM Tris, 0.9% NaCl, 0.2% BSA, pH 7.4) and once with Hobbs Buffer B 
(50mM Tris, 0.9% NaCl, pH 7.4 (no BSA)) at 

Processing/Analysis. Cells were digested overnight with 0.2N NaOH, 2mLywell at room 
5 temperature. One 1 ,5 mL aliquot was removed from each well, neutralized & counted for 

radioactivity by scintillation counting. Two additional 50jixl aliquots from all wells are assayed for 
total protein by the Pierce micro BC A method. The quantity of labeled cholesterol observed in the 
cells was normalized by the quantity of protein in the cells. 

10 Table 3. Uptake of ^H-cholesterol by CHO cells transfected with rat NPCILI or mouse SR- 
Bl or untransfected CHO cells. 





Total Cholesterol, dpin protdn ± sem 






Total Cholesterol, di 


)ni/mg proteLQ ± sem 


Time, imn 


NPCILI 


CHO 






NPCILI 1 


CHO 




Afters-Cholesterol 








No Starve 






22881 


±9230 


0 


2067 


±46 


4568 


±1937 






10754 


±166 


4 


2619 


±130 


2868 


±193 






15366 


±938 


15636 


±1471 


12 


2868 


±193 


4459 


±170 






15636 


±1471 


24622 


±966 


24 


7010 


±89 


7204 


±173 






41129 


±685 


39361 


±1207 










Si 


ar 


ve 










0 


1937 


±273 


2440 


±299 






10909 


±1847 


12429 


±1673 


4 


3023 


±308 


2759 


±105 






17278 


±1650 


14307 


±781 


12 


2759 


±105 


4857 


±186 






14307 


±781 


26270 


±1473 


24 


6966 


±72 


7344 


±65 






39196 


±174 


38381 


±161 



dpm=disintegrations per minute 
sem=standard error of the mean 

15 

Example 19 : Effect of Ezetimibe on Cholesterol Uptake. 

The effect of ezetimibe on the ability of CHO cells stably transfected with mouse or rat 
NPCILI or mouse SR-Bl to take up ^H-labeled cholesterol was evaluated in pulse-chase 
experiments. One cDNA clone of mouse NPCILI (C7) and three clones of rat NPCILI (C7, C17 
20 and C21) were evaluated. The ability of CHO cells stably transfected with mouse SR-Bl, mouse 
NPCILI and rat NPCILI to take up labeled cholesterol, in the absence of ezetimibe, was also 
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evaluated in the pulse-chase experiments. Data generated in these experiments are set forth, below, 
in Tables 4 and 5, Additionally, the quantity of total cholesterol taken up by transfected and 
untransfected CHO cells in the presence of four different unlabeled cholesterol concentrations was 
also evaluated. The data from these experiments is set forth, below, in Table 6. 

CeUs: 

A. CHO cells stably transfected with rat or mouse NPCILI cDNA 

B. CHO background (no transfection) 

C. SR-Bl transfected CHO cells 

Cells seeded at 500,000 cells / well (mL) in 12-well plates. 

Procedure: 

All reagents and culture plates were maintained at 37**C unless otherwise noted. 
Starve. The maintenance media (F12 HAMS, l%Pen/Strep, 10%FCS) was removed and 
the cells were rinsed with serum-free HAMS media. The serum-free media was then replaced with 
1 mL "starve" media (F12 HAMS, Pen/Strep, 5% lipoprotein deficient serum (LPDS). The cells 

were then starved overnight. 

Pre-Incubation/ pre-dose. Media was removed from all plates and replaced with fresh 
starve media and preincubated for 30 minutes. Half of the wells received media containing 
ezetimibe (stock soln in EtOH; final cone. = lOjtxM), 

^H-ChoUsterol Pulse. The following was added directly to each well: 

O.SjtxCi ^H-cholesterol (-1.1 X 10^ dpm/well) in SOji^l of a mixed bile salt micelle 
4.8mM sodium taurocholate (2.581mg/mL) 
0,6 mM sodium oleate (0.183mg/mL) 
0.25 mM cholesterol (0.1 mg/mL) 
Dispersed in "starve" media by ultrasonic vibration 
Final media cholesterol concentration = 5jLtg/mL 
Labeled cholesterol pulse time points were 4, 12, 24 minutes and 4 hours. Triplicate wells 
were prepared for each treatment. 

Wash. At designated times, media was aspkated and cells were washed once with Hobbs 
Buffer A (50mM Tris, 0.9% NaCl, 0.2% bovine serum albumin (BSA), pH 7.4) and once with 
Hobbs Buffer B (50mM Tris, 0.9% NaCl, pH 7.4 (no BSA)) at 37°C. 
Processing/Analysis. 
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A, 4, 12, 24 minute time points : Cells were digested overnight with 0.2N NaOH, 2mIVwell, room 
temperature. One 1.5 mL aliquot was removed from each well, neutralized & counted for 
radioactivity by scintillation counting. 

B, 4 hour time point : The digested cells were analyzed by thin-layer chromatography to determine 
5 the content of cholesterol ester in the cells. 

Extracts were spotted onto TLC plates and run for 30 minutes in 2 ml hexane:isopropanol 
(3:2) mobile phase for 30 nainutes, followed by a second run in 1ml hexane:isopropanol (3:2) 
mobile phase for 15 minutes. 

C, Protein determination of cell extracts . Plates containing a sample of the cell extracts were 

10 placed on orbital shaker at 120 rpm for indicated times and then extracts are pooled into 12 X 75 
tubes. Plates were dried and NaOH (2ml/well) added. The protein content of the samples were 
then determined. Two additional 50jLtl aliquots from all wells were assayed for total protein by the 
Pierce micro BCA method. The quantity of labeled cholesterol observed in the cells was 
normalized to the quantity of protein in the cells. 

15 

Table 4. Total Cholesterol in Transfected CHO Cells in the Presence and Absence of 
Ezetimibe. 



wo 2004/009772 



PCT/US2003/022467 



-42- 





Total Cholesterol, dpm ± sem 




Total Cholesi^ol, dprn/mg protein 


±seni 




Vehicle 


EZaOwM) 




Vdilde 


EZdOftM) 


Clones: 








4Min Pulse 








CHO Control 








d26 




33443 


+4070 


31881 


±483 


SR-BI 


14207 


±51 


10968 


±821 




118242 


±1261 


92474 


±2902 


mNPCULKCT) 


4043 


±419 


4569 


±222 




30169 


±3242 


30916 


±1137 


rNPClLl(C21) 


3283 


:i288 


3769 


±147 




23728 


di2111 


27098 


±689 


rNPClLKClT) 


3188 


i232 


3676 


±134 




24000 


±832 


28675 


±527 


rNPCULKCT) 


1825 


dS06 


3268 


±121 




15069 


±6794 


27285 


±968 










12 IVGn Pulse 








CHO Control 


4710 


df246 


4532 


±165 




44208 


±2702 


43391 


±1197 


SR-BI 


16970 


±763 


12349 


±298 




140105 


±6523 


98956 


±4447 


mNPCULKCT) 


6316 


±85 


6120 


±755 




45133 


±342 


41712 


±4054 


rNPClLKCa.) 


5340 


±12 


4703 


±231 




40018 


±1181 


33985 


±1928 


rNPClUL(C17) 


4831 


±431 


4579 


i257 




37378 


±3461 


34063 


±1619 


rNPClLKCT) 


4726 


d£272 


4664 


±63 




39100 


i2350 


38581 


±784 












24 IVGn Pulse 








CHO Control 


7367 


i232 


6678 


dL215 




65843 


±1281 


61764 


d2131 


SRtBI 


39166 


±2152 


23558 


±1310 




324126 


±11848 


198725 


±11713 


mNPClLKCT) 


10616 


±121 


9749 


±482 




77222 


±1040 


74041 


±3670 


rNPClLl(C21) 


9940 


±587 


8760 


d293 




76356 


±9618 


66165 


df2181 


rNPCULKClT] 


I 8728 


±721 


8192 


i237 




70509 


±5189 


62279 


±4352 


rNPC10(C7] 


f 8537 


±148 


7829 


df204 




72134 


±1305 


63482 


±368 



EZ = ezetimibe 
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Table 5. Cholesterol Ester in CHO cells in the Presence or Absence of Ezetimibe. 





Oinlestervl Ester, dpm ± sem 




Cholesteryl £ster, dpnymg protein 


±sem 


Vehicle 




Vdiide 


£Z(10|iM) 


Qones: 








4HourPiilse 








CHO Control 


652 


±13 


208 






5647 


±55 


1902 


d^7 


SR-BI 


47608 


±1292 


9305 


±401 




391067 


±14391 


72782 


43181 


inNPCU^KCT) 


732 


±127 


453 


±118 




4994 


±827 


3057 


±776 


rNPClLl(C21) 


2667 


±90 


454 


±33 




18655 


±1<B2 


3193 


±265 


rNPCU.l(C17) 


751 


±74 


202 


±10 




5379 


±481 


1510 


±62 


rNPClLKCT) 


462 


£25 


191 


±54 




3597 


±193 


1496 


±403 





Ftee Cliolesterol, dim ± son 




Free Cholesterol, dpm/m^ proton 


±seni 


Vehide 


EZ(10mM) 


Vetude 


EZ(lOAtM) 










4 Hour Pulse 








CHO Control 


61612 


±1227 


56792 


±568 




533876 


±17770 


519607 


±16203 


SR-BI 


214678 


±4241 


194519 


±474 




1762873 


±46607 


1521341 


±4185 


BoNPClLKCT) 


79628 


±793 


77516 


±1910 




544661 


±1269 


523803 


±10386 


rNPClLl(C21) 


71352 


±1343 


69106 


±711 




498016 


±8171 


485460 


±4410 


rNPCULKCl?) 


78956 


±3782 


71646 


±446 




566456 


±29204 


536651 


±7146 


rNPCULKCT) 


75348 


±2093 


70628 


±212 




586127 


±13932 


556855 


±7481 



EZ =ezetimibe 



10 



15 
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Table 6. Uptake of labeled cholesterol in the presence of increasing amounts of unlabeled 
cholesteroL 





Total Cholesterol, dom ± sem 




Total Cholesterol, dpm/me protein ± sem 




CHO Control 1 


SR-BI 


m1MPClLUC7ll rNPClLlfC21) 




CHO Control 1 


SR-BI 


1 mNPClLlfC7) 1 rNPC:iLirC21> 










24 Min Pulse 




140764 ±14433 


94945 ±12916 




12271 ±430 


49603 ±2428 


14250 ±1628 


10656 ±1233 




108936 ±5413 


541562 ±13783 


10 ftgTtnL 


16282 ±2438 


79967 ±8131 


25465 ±3037 


13225 ±4336 




151283 ±23343 


880224 ±82234 


250985 :i27481 


123433 ±34092 


30 ^g/mL 


14758 ±1607 


71925 ±3863 


19001 ±1530 


13218 ±1149 




135109 ±12106 


796236 ±18952 


180436 ±12112 


111522 ±6941 


100 nefml. 


16458 ±1614 


58185 ±4348 


15973 ±1663 


1 1560 ±1132 




149559 ±17977 


630143 ±3718 


147717 ±8261 


101328 ±7191 






Cholestervl Ester, dom ± sem 






Cholestervl Ester, dpm/me protein ± sem 




P 

CHO Control 1 


SR-BI 


lmNPClLKC7ll rNPClLlCC21) 




CHO Control 1 


SR-BI 


1 mNPClLKC?) 1 


rNPClLl(C21) 










4 Hour Pulse 








3 ftg/mL 


2737 ±114 


39596 ±1241 


1561 ±1 


4015 ±47 




22050 ±978 


382641 ±3933 


13684 ±217 


32020 ±641 


10 ftg/tnL 


1646 ±76 


17292 ±362 


998 £i6 


1866 ±33 




13323 ±606 


157914 ±3400 


8917 ±467 


14849 ±127 


30 ngfmt. 


970 ±46 


6642 ±133 


537 ±82 


970 ±9 




7627 ±323 


63S47 ±1760 


4885 ±748 


7741 ±100 


100 ftg/mL 


895 ±136 


4777 ±27 


405 ±7 


777 ±16 




7135 ±1230 


45088 ±1326 


3663 ±68 


6005 ±198 






Free Cholesterol, dpm ±sem 






Free Cholesterol, dpm/mg protein ± sem 




r 

CHO Control 1 


SR-BI 


lmNPClLWC7>l rNPClLl(C21) 




CHO Control 1 


SR-BI 


1 mNPClLl(C7) 1 


rNPClLlfC21) 










4 Hour Pulse 








3 fig/tnL 


89013 ±3724 


211783 ±3268 


104343 ^112 


92244 ±987 




717308 :i34]30 


2047695 ±16213 


914107 ±3869 


735498 ±11209 


10 ftg/mL 


136396 ±8366 


278216 ±10901 


196173 ±4721 


125144 ±877 




1105118 ±76074 


2540130 ±92471 


1753072 ±86378 


996824 ±27830 


30 ftg/tnL 


131745 ±2922 


224429 ±2336 


149172 ±19689 


117143 ±4976 




1036195 ^21142 


2149315 ±78068 


1357136 ±180264 


934772 ±43202 


100 fig/mL, 


79336 ±4011 


231470 ±4221 


114599 d2803 


93538 ±1388 




632965 ^29736 


2182022 ±36793 


1035979 ;i^0329 


723225 ±21694 






Cholestervl Ester, dpm ± sem 






Cholesteryl Ester, 


dpm/me protein ± sem 




CHO Control 1 


SR-BI 


ImNPClLUC?)! rNPClLl(C21) 




CHO Control 1 


SR-BI 


1 mNPClLl(C7) 1 rNPClLUC21) 










24 Hour Pulse 








3 itg/mL 


57373 ±2704 


162296 ±1644 


22986 ±940 


59377 ±933 




357629 ±14639 


1248900 ±18363 


160328 ±6363 


401315 ±3337 


10 (ig/mL 


33730 ±1296 


112815 ±373 


14836 ±332 


31797 ±523 




215004 ±5942 


830231 ±12764 


98594 ±4203 


200451 ±3239 


30 ftgfmL 


19193 ±100 


58668 ±1413 


8878 ±333 


18963 ±380 




12207 1 ±1271 


446581 ±3472 


59091 ±2697 


119728 ±2131 


100 ftg/niL 


16761 ±398 


31280 ±1270 


8784 ±946 


14933 i^ll 




103235 ±1739 


272796 ±13392 


60670 ±4397 


96215 ±1023 






Free Cholesterol, dpm ± sem 






Free Cholesterol, dpm/mg protein ± sem 




CHO Control i 


SR-BI 


lmNPClLl(C7)l rNPClLUC21) 




CHO Control 1 


SR-BI 


1 mNPClLKC?) 1 


rNPClLl{C21) 










24 Hour Pulse 








3fts/mL 


248985 ±4207 


357819 ±4319 


285610 ±3187 


227244 ±1016 




1552637 ±18934 


2752957 ±24984 


1993256 ±36968 


1536023 ±10304 


10 ftg/mL 


23 1208 ±8927 


269822 ±3872 


311777 ±8227 


231666 ±6198 




1477414 ±83934 


1984473 ±18420 


2069980 ±23317 


1461157 ±38317 


30 /tE/niL 


203566 ±6008 


225273 ±3932 


279604 ±6612 


209372 ±3386 




1294878 ±41819 


1716066 ±32381 


1859476 ±29307 


1321730 ±3432 


100 ng/mL 


178424 ±2379 


167082 d221I 


229832 ±4199 


182678 ±7709 




1099648 ±23160 


1455799 ±9883 


1599244 ±76938 


1177546 ±31191 



Example 20 : Labeled Cholesterol Uptake Assay. 

In this example, the ability of CHO cells transiently transfected with rat NPCILI or mouse 
SR-BI to take up labeled cholesterol was evaluated. Also evaluated was the ability of rat NPCILI 
to potentiate the ability of CHO cells transfected with mouse SR-BI to take up labeled cholesterol. 
In these assays, cholesterol uptake, at a single concentration, was evaluated in pulse-chase 
experiments. The data generated in these experiments are set forth, below, in Table 7. 

CeUs: 

A. CHO background cells (mock transfection), 

B. CHO cells transiently transfected with mouse SR~BL 

C. CHO transiently transfected with rst NPCILI cDNAs (n=8 clones). 
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Transiently transfected cells were seeded at 300,000 cells / well (niL) in 12-well plates. 
Procedure: 

All reagents and culture plates were maintained at 37°C unless odierwise noted. 

5 

Starve. The maintenance media (F12 HAMS, l%Pen/Strep, 10%FCS) was removed from 
the cells and replaced with 1 mL "starve" media (F12 HAMS, Pen/Strep, 5% lipoprotein deficient 
serum (LPDS), Cells were starved for 1 hour. 

^H'Cholesterol Pulse. The following was added direcdy to each well. 
10 O.SjtiCi ^H-cholesterol (~ LI X 10^ dpm/well) in SO/il of a mixed bile salt micelle. 

4.8mM sodium taurocholate (2,581mg/mL) 
0.6 ntiM sodium oleate (0,183mg/mL) 
0.25 mM cholesterol (0.1 mg/mL) 
15 Dispersed in "starve" media by ultrasonic vibration 

Final media cholesterol concentration = 5/xg/mL 
Labeled cholesterol pulse time points were 24 Min and 4 hours. Triplicate wells for each 
treatment. 

Wash, At the designated times, media was aspirated and cells were washed once with 
20 Hobbs Buffer A (50mM Tris, 0.9% NaCl, 0.2% BSA, pH 7.4) and once with Hobbs Buffer B 
(50mM Tris, 0.9% NaCl, pH 7.4 (no BSA)) at 37°C, 
Processing/Analysis. 

A. 24 minute time point : Cells were digested overnight with 0.2N NaOH, 2mL/well at room temp. 
One, 1.5 mL aliquot was removed from each well, neutralized & counted for radioactivity by 

25 scintillation counting. 

B. 4 hour time point : The digested cells were analyzed by thin-layer chromatography to determine 
the content of cholesterol ester in the cells. 

The extracts were spotted onto thin layer cliromatography plates and run in 2 ml 
hexane:isopropanol (3:2) containing mobile phase for 30 minutes, followed by a second run in 1ml 
30 hexane:isopropanol (3:2) containing mobile phase for 15min. 

C. Protein detemiination of cell extracts : Plates containing a sample of the cell extracts were 
placed on orbital shaker at 120 rpm for indicated times and then extracts are pooled into 12X75 
tubes. Plates were dried and NaOH (2ml/well) added. The protein content of the samples were 
then determined. Two additional SOfil aliquots from all wells were assayed for total protein by the 
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Pierce micro BCA method. The quantity of labeled cholesterol observed in the cells was 
normalized to the quantity of protein in the cells. 



5 Table 7, Labeled cholesterol uptake in transiently transfected CHO cells. 





Total Cholesterol, ± sem 




dpm 




dpm/mg protem 


Transfection 


24 Min Pulse 


CHO Control (mock) 


4721 ±436 




49024 ±4328 


SR-BICIVansient) 


5842 ±82 




59445 ±1099 


NPCILI (Transient) 


4092 ±377 




47026 ±2658 


SR-Bl/NPCILI (trans) 


3833 ±158 




52132 ±3071 




Cholesteryl Ester, ± sem 




dpm 




dpm/mg protein 




4 Hour Pulse 


CHO Control (mock) 


2132 ±40 




20497 ±640 


SR-BI(Transient) 


5918 ±237 




51812 ±1417 


NPCILI CTransient) 


1944 ±93 




19788 ±642 


SR-Bl/NPCILI (trans) 


4747 ±39 




58603 ±1156 




Free Cholesterol, ± sem 




dpm 




dpnVmg proteui 




4 Hour Pulse 


CHO Control (mock) 


45729 =t328 




439346 ±5389 


SR-BICItansient) 


50820 ±f2369 




444551 ±9785 


NPCILI (Transient) 


39913 ±1211 




406615 ±6820 


SR-BI/NPCILI (trans) 


37269 ±1225 




459509 ±6195 



Example 21 : Expression of rat, mouse and human NPCILI. 

In this example, NPCILI was introduced into cells and expressed. Species specific 
10 NPCILI expression constructs were cloned into the plasxnid pCDNA3 using clone specific PCR 
primers to generate the ORF flanked by appropriate restriction sites compatible with the polylinker 
of the vector. For all three species of NPCILI, small intestine total tissue RNA was used as a 
template for reverse transcriptase-polymerase chain reaction (RT-PCR) using oligo dT as the 
template primer. The rat NPCILI was cloned as an EcoRI fragment, human NPCILI was cloned 



wo 2004/009772 



PCT/US2003/022467 



-47- 

as a Xbal/NotI fragment and mouse NPCILI was cloned as an EcoRI fragment. Forward and 
reverse strand sequencing of each clone was performed to confirm sequence integrity. Standard 
transient transfection procedures were used with CHO cells. In a 6- well plate CHO cells were 
plated 1 day before transfection at a plating density of 2 X 10^ cells/well. The following day, cells 

5 were incubated with 2 \xjg plasmid DNA and 6 \xL Lipofectamine for 5 hours followed a fresh 
media change. Forty-eight hours later, cells were analyzed for NPCILI expression using anti- 
NPCILI antisera by either FACS or westem blot. To establish stable long term cell lines 
expressing NPCILI, transfected CHO cells were selected in the presence of geneticin (G418, 0.8 
mg/ml) as recommended by the manufacturer (Life Technologies). Following one month of 

10 selection in culture, the cell population was stained with anti-NPClLl antisera and sorted by 
FACS. Individual positive staining cells were cloned after isolation by limiting dilution and then 
maintained in selective media containing geneticin (0.5 mg/ ml). 

Other cell types less susceptible to transfection procedures have been generated using 
adenoviral vector systems. This system used to express NPCILI is dervied from Ad 5, a type C 

15 adenovirus. This recombinant replication-defective adenoviral vector is made defective through 

modifications of the El, E2 and E4 regions . The vector also has additional modifications to the E3 
region generally affecting the E3b region genes RIDa and RBDb. NPCILI expression was driven 
using the CMV promoter as an expression cassette substituted in the E3 region of the adenovirus. 
Rat and mouse NPCILI were amplified using clone specific primers flanked by restriction sites 

20 compatible with the adenovirus vector Adenovirus infective particles were produced from 293- 
D22 cells in titers of 5 X 10^^ P/mL. Viral lysates were used to infect cells resistant to standard 
transfection methodologies. In Caco2 cells, which are highly resistant to heterologous protein 
expression, adenovirus mediated expression of NPCILI has been shown by westem blot analysis 
to persist at least 21 days post-infection. 

25 

Example 22 : NPCILI Knock-Out Transgenic Mouse. 

NPCILI knockout mice were constructed via targeted mutagenesis. This methodology 
utilized a targeting constmct designed to delete a specific region of the mouse NPCILI gene. 
During the targeting process the E. coli lacZ reporter gene was inserted under the control of the 
30 endogenous NPCILI promoter. The region in NPCILI (SEQ ID NO: 45) being deleted is from 
nucleotide 790 to nucleotide 998. The targeting vector contains the LacZ-Neo cassette flanked by 
1.9 kb 5' arm ending with nucleotide 789 and a 3.2 kb 3' arm starting with nucleotide 999. 
Genomic DNA from the recombinant embryonic stem cell line was assayed for homologous 
recombination using PGR. Amplified DNA fragments were visualized by agarose gel 
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electrophoresis. The test PCRs employed a gene specific primer, which lies outside of and 
adjacent to the targeting vector arm, paired with one of three primers specific to the LacZ-Neo 
cassette sequence. For 5' PCR reconfirmation, the NPCILI specific oligonucleotide 
ATGTTAGGTGAGTCTGAACCTACCC (SEQ ID NO: 46) and for 3'PCR reconfirmation the 

5 NPCILI specific oligonucleotide GGATTGCATTTCCTTCAA GAAAGCC (SEQ ID NO: 47) 
were used. Genotyping of the F2 mice was performed by multiplex PCR using the NPCILI 
specific forward primer TATGGCTCTGCCC TCTGCAATGCTC (SEQ ID NO: 48) the LacZ-Neo 
cassette specific forward primer TCAGCAGCCTCTGTTCCAC ATACACTTC (SEQ ID NO: 49) 
in combination with the NPCILI gene specific reverse primer 

10 GTTCCACAGGGTCTGTGGTGAGTTC (SEQ ID NO: 50) allowed for determination of both the 
targeted and endogenous alleles. Analysis of the PCR products by agarose gel electrophoresis 
distinguished the wild-type, heterozygote and homozygote null mouse from each other. 

•1* •!« *i* •S' •t' *i* ^» •S' *1* •I' •!« •!« alf •!£ •!« •!£ (f* •!« 4t 4t %l£ 

«fi Jfi «^ ^ «^ ^fiSl'^P *P *P *P *P *P *P ^ •(•^•ji ^ ^ ^ ^ ^ ^ ^ 

The present invention is not to be limited in scope by the specific embodiments described 
15 herein. Indeed, various modifications of the invention in addition to those described herein will 

become apparent to those skilled in the art from the foregoing description. Such modifications are 
intended to fall within the scope of the appended claims. 

Patents, patent applications, publications, product descriptions, Genbank Accession 
Numbers and protocols are cited throughout this application, the disclosures of which are 
20 incorporated herein by reference in their entireties for all purposes. 
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We Claim: 

1. An isolated polypeptide comprising 42 or more contiguous amino acids from an amino acid 
sequence selected from SEQ ID NOs: 2 and 12. 

5 2, An isolated polypeptide comprising an amino acid sequence selected from SEQ ID NOs: 2 and 
12. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

10 4. An isolated polynucleotide comprising a nucleotide sequence selected from SEQ ID NOs: 1 and 
11. 

5. A recombinant vector comprising the polynucleotide of claim 3. 
15 6. A host cell comprising the vector of claim 5. 

7. An antibody which specifically binds to a polypeptide of claim 1. 

8. An antibody which specifically binds to a polypeptide comprising an amino acid sequence 
20 selected from SEQ ID NOs: 39-42. 

9. A method for making a polypeptide comprising culturing a host cell of claim 6 under conditions 
in which the nucleic acid is expressed. 

25 10. The method of claim 9 wherein the polypeptide is isolated from the culture. 

11. A method for identifying an antagonist of NPCILI comprising: 

(a) contacting a host cell expressing a polypeptide comprising an amino acid sequence 
selected from SEQ ID NOs: 2, 4 and 12 or a functional fragment thereof on a cell surface, in the 

30 presence of a known amount of detectably labeled ezetimibe, with a sample to be tested for the 

presence of the antagonist; and 

(b) measuring the amount of detectably labeled ezetimibe specifically bound to the 

polypeptide; 



wo 2004/009772 



PCT/US2003/022467 



-50- 

wherein an NPCILI antagonist in the sample is identified by measuring substantially 
reduced binding of the detectably labeled ezetimibe to the polypeptide, compared to what would be 
measured in the absence of such an antagonist. 

5 12. A method for identifying an antagonist of NPCILI comprising: 

(a) placing, in an aqueous suspension, a plurality of support particles, impregnated with a 
fluorescer, to which a host cell expressing a polypeptide comprising an amino acid sequence 
selected from SEQ ID NOs: 2, 4 and 12 or a functional fragment thereof on a cell surface are 
attached; 

10 (b) adding, to the suspension, radiolabeled ezetimibe and a sample to be tested for the 

presence of the antagonist, wherein the radiolabel emits radiation energy capable of activating the 
fluorescer upon the binding of the ezetimibe to the polypeptide to produce light energy, whereas 
radiolabeled ezetimibe that does not bind to the polypeptide is, generally, too far removed from the 
support particles to enable the radioactive energy to activate the fluorescer; and 

15 (c) measuring the light energy emitted by the fluorescer in the suspension; 

wherein an NPCILI antagonist in the sample is identified by measuring substantially 
reduced light energy emission, compared to what would be measured in the absence of such an 
antagonist. 

20 13. The method of claim 12 wherein the fluorescer is selected from yttrium silicate, yttrium oxide, 
diphenyloxazole and poly vinyltoluene. 

14. A method of claim 1 1 wherein the ezetimibe is labeled with a radiolabel selected from and 

125j 

25 

15. A method of claim 12 wherein the ezetimibe is labeled with a radiolabel selected from and 

16. A method for identifying an antagonist of NPCILI comprising: 

30 (a) contacting a host cell expressing a polypeptide comprising an amino acid sequence 

selected from SEQ ID NOs: 2, 4 and 12 or a functional fragment thereof on a cell surface with 
detectably labeled cholesterol and with a san^le to be tested for the presence of the antagonist; and 

(b) measuring the amount of detectably labeled cholesterol in the cell; 
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wherein an NPCILI antagonist in the sample is identified by measuring substantially reduced 
detectably labeled cholesterol within the host cell, compared to what would be measured in the 
absence of such an antagonist. 

17 . The method of claim 16 wherein the cholesterol is detectably labeled with a radiolabel 
selected from and ^^^L 

18. A method according to claim 1 1 wherein the host cell is selected from a Chinese hamster ovary 
(CHO) cell, a J774 cell, a macrophage cell and a Caco2 cell. 

19. A method according to claim 12 wherein the host cell is selected from a Chinese hamster ovary 
(CHO) cell, a J774 cell, a macrophage cell and a Caco2 cell. 

20. A method according to claim 16 wherein the host cell is selected from a Chinese hamster ovary 
(CHO) cell, a J774 cell, a macrophage cell and a Caco2 cell. 

21. A mutant mouse comprising a homozygous disruption of endogenous, chromosomal NPCILI 
wherein the mouse does not produce any functional NPCILI protein. 
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SEQUENCE LISTING 



<110> Severing Corporation 



:120> NPClIil (NPC3) AND METHODS OF USE THEREOF 



<130> JB01603-K-WI 



<160> 50 



<170> Patentin version 3 . 1 



<210> 1 

<211> 3996 

<212> DNA 

<213> Rattus sp 



<220> 

<221> CDS 

<222> (1)..(3996) 

<223> 



<400> 1 

atg gca get gcc tgg ctg gga tgg ctg etc tgg gcc ctg etc ctg age 48 
Met Ala Ala Ala Trp Leu Gly Trp Leu Leu Trp Ala Leu Leu Leu Ser 
15 10 15 

gcg gcc cag ggt gag eta tac aca cec aaa cac gaa get ggg gtc tgc 96 
Ala Ala Gin Gly Glu Leu Tyr Thr Pro Lys His Glu Ala Gly Val Cys 

20 25 30 

ace ttt tac gaa gag tgc ggg aaa aac eca gag etc tct gga ggc etc 144 
Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 
35 40 45 

aeg tea eta tee aat gta tee tgc ctg tct aac ace ceg gee cge cae 192 
Thr Ser Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 
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gtc acg ggt gaa cac ctg get ctt etc cag cgc ate tgt ccc ege etg 240 

Val Thr Gly Gl\i His Leu Ala Leu Leu Gin Arg lie Cys Pro Arg Leu 
65 70 75 80 

tae aae gge eec aat aee aet ttt gee tgt tge tct aee aag eag etg 288 
Tyr Asn Gly Pro Asn Thr Thr Phe Ala Cys Cys Ser Thr Lys Gin Leu 

85 90 95 

etg tee tta gaa age age atg tee ate ace aag gee ett ete aeg ego 336 
Leu Ser Leu Glu Ser Ser Met Ser lie Thr Lys Ala Leu Leu Thr Arg 

100 105 110 

tgc ccg gcc tgc tct gac aat ttt gtg age tta cac tge cac aac act 384 
Cys Pro Ala Cys Ser Asp Asn Phe Val Ser Leu His Cys His Asn Thr 
115 120 125 

tgc age ect gac eag age etc ttc ate aac gtc ace egg gtg gtt gag 432 
Cys Ser Pro Asp Gin Ser Leu Phe lie Asn Val Thr Arg Val Val Glu 
130 135 140 

egg gge get gga gag cet cet gcc gtg gtg gee tat gag gee ttt tat 480 
Arg Gly Ala Gly Glu Pro Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

eag cgc age ttt get gag aag gee tat gag tee tge age eag gtg cgc 528 
Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Gin Val Arg 

165 170 175 

ate cet gcg gcc get tec ttg gcc gtg gge age atg tgt gga gtg tat 576 
lie Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 

180 185 190 

gge tee gee etc tge aat get eag ege tgg etc aac ttc caa gga gac 624 
Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 

aca ggg aat gge etg get ccg etg gat ate aee ttc cac ete ttg gag 672 
Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 

cet gge cag gcc eta ccg gat ggg ate eag cea ctg aat ggg aag ate 720 
Pro Gly Gin Ala Leu Pro Asp Gly lie Gin Pro Leu Asn Gly Lys lie 
225 230 235 240 

gea ccc tgc aac gag tct cag ggt gat gac tea gea gtc tgc tec tgc 768 
Ala Pro Cys Asn Glu Ser Gin Gly Asp Asp Ser Ala Val Cys Ser Cys 

245 250 255 

eag gac tgt gcg gcg tee tge cet gtc ate cet ccg ccc gag gcc ttg 816 
Gin Asp Cys Ala Ala Ser Cys Pro Val lie Pro Pro Pro Glu Ala Leu 

260 265 270 

cgc cet tee ttc tae atg ggt ege atg cea gge tgg etg gee ete ate 864 
Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu He 
275 280 285 

ate ate ttc act get gtc ttt gtg ttg ete tct gea gtc ett gtg egt 912 
He He Phe Thr Ala Val Phe Val Leu Leu Ser Ala Val Leu Val Arg 
290 295 300 



etc ega gtg gtt tee aac agg aac aag aae aag gea gaa gge ccc eag 



960 
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Leu Arg Val Val Ser Asn Arg Asn Lys Asn Lys Ala Glu Gly Pro Gin 
305 310 315 320 

gaa gcc ccc aaa etc cct cat aag cac aaa etc tea ccc cat acc ate 1008 
Glu Ala Pro Lys Leu Pro His Lys His Lys Leu Ser Pro His Thr lie 

325 330 335 

ctg gge egg ttc ttc cag aac tgg ggc aca agg gtg gcc teg tgg cea 1056 
Leu Gly Arg Phe Phe Gin Asn Trp Gly Thr Arg Val Ala Ser Trp Pro 

340 345 350 

etc acc gtc tta gca ctg tec ttc ate gtt gtg ata gcc tta gea gea 1104 
Leu Thr Val Leu Ala Leu Ser Phe lie Val Val lie Ala Leu Ala Ala 
355 360 365 

ggc ctg ace ttt att gaa etc ace aca gae cct gtg gaa ctg tgg teg 1152 
Gly Leu Thr Phe lie Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 

gee eee aag age cag gee egg aaa gag aag tet ttc cat gat gag cat 1200 
Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ser Phe His Asp Glu His 

385 390 395 400 

ttc gge ccc ttc ttt cga acc aac cag att ttc gtg aca get egg aac 1248 
Phe Gly Pro Phe Phe Arg Thr Asn Gin He Phe Val Thr Ala Arg Asn 

405 410 415 

agg tec age tac aag tac gac tec eta ctg eta ggg tec aag aac ttc 1296 
Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Ser Lys Asn Phe 

420 425 430 

agt ggg ate ctg tee ctg gae ttc ctg ctg gag ctg ctg gag ett cag 1344 
Ser Gly He Leu Ser Leu Asp Phe Leu Leu Glu Leu Leu Glu Leu Gin 
435 440 445 

gag agg ett cga cac ctg caa gtg tgg tee cct gag gea gag ege aac 1392 
Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Glu Arg Asn 

450 455 460 

ate tee etc cag gae ate tge tat gcc ccc etc aac cea tat aac ace 1440 
He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Tyr Asn Thr 
465 470 475 480 

age etc tec gac tgc tgt gtc aac age etc ett cag tac ttc cag aac 1488 
Ser Leu Ser Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 

485 490 495 

aac ege ace etc ctg atg etc acg gee aac cag act ctg aat ggc cag 1536 
Asn Arg Thr Leu Leu Met Leu Thr Ala Asn Gin Thr Leu Asn Gly Gin 

500 505 510 

acc tec ctg gtg gae tgg aag gac eat ttc etc tac tgt gca aat gee 1584 
Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 

515 520 525 

cct etc acg ttc aaa gat ggc acg tct ctg gcc ctg age tge atg get 1632 
Pro Leu Thr Phe Lys Asp Gly Thr Ser Leu Ala Leu Ser Cys Met Ala 
530 535 540 



gac tac ggg get cct gtc ttc ccc ttc ett get gtt ggg gga tac caa 
Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala Val Gly Gly Tyr Gin 



1680 
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545 

ggc acg gac 
Gly Thx Asp 



aat aac tac 
Asn Asn Tyr 



gag gag get 
Glu Glu Ala 
595 

gac aag ttc 
Asp Lys Pile 
610 

ate aae ege 
lie Asn Arg 
625 

att ate gtc 
lie lie Val 



tge age ega 
Cys Ser Arg 



gtg att gtt 
Val He Val 
675 

tae etg ggt 
Tyr Leu Gly 
690 

etg gtg eta 
Leu Val Leu 
705 

tae eag agg 

Tyr Gin Arg 



gge ege aee 
Gly Arg Thr 



tct gag gee 
Ser Glu Ala 
755 

gtg agg ace 
Val Arg Thr 
770 

etg ete eag 
Leu Leu Gin 
785 



550 

tat tee gag 
Tyr Ser Glu 
565 

ccc get gat 
Pro Ala Asp 
580 

ttc ttg aag 

Phe Leu Lys 



eag gtt gcg 
Gin Val Ala 



aee aee ate 
Thr Thr He 
630 

ttc etg tac 
Phe Leu Tyr 
645 

gta gcg gtg 
Val Ala Val 
660 

gtg etg gga 
Val Leu Gly 



gtc ccc tct 
Val Pro Ser 



get gtg gga 
Ala Val Gly 
710 

eta cet agg 
Leu Pro Arg 
725 

etg ggc agt 
Leu Gly Ser 

740 

ate tgc ttc 
He Cys Phe 



ttc gee ttg 
Phe Ala Leu 



atg act gee 
Met Thr Ala 
790 



gca gaa gcg 
Ala Glu Ala 



gat ccc cgc 
Asp Pro Arg 
585 

gaa atg gaa 
Glu Met Glu 
600 

ttc tea get 
Phe Ser Ala 
615 

eag gac etg 
Gin Asp Leu 



ate tec etg 
He Ser Leu 



gag tec aag 
Glu Ser Lys 
665 

gca gtt etg 
Ala Val Leu 
680 

tct etg gtt 
Ser Leu Val 
695 

get gac aac 
Ala Asp Asn 



atg cet ggg 
Met Pro Gly 



gtg gee eee 
Val Ala Pro 

745 

ttt eta ggg 
Phe Leu Gly 
760 

ace tct ggc 
Thr Ser Gly 
775 

ttt gtg gee 
Phe Val Ala 



4/73 
555 

etg ate ata 
Leu He He 
570 

atg gee cag 
Met Ala Gin 



tee ttc cag 

Ser Phe Gin 



gag ege tct 
Glu Arg Ser 
620 

cet gtc ttt 
Pro Val Phe 
635 

gee etg ggc 
Ala Leu Gly 
650 

get act etg 
Ala Thr Leu 



get gee atg 
Ala Ala Met 



ate ate caa 
He He Gin 
700 

ate ttc ate 
He Phe He 
715 

gaa eag ega 

Glu Gin Arg 
730 

age atg etg 
Ser Met Leu 



gee etg ace 
Ala Leu Thr 



tta gca att 
Leu Ala He 
780 

etg ete tee 

Leu Leu Ser 
795 



ace ttc tct 
Thr Phe Ser 
575 

gee aag etc 
Ala Lys Leu 
590 

agg aac aca 

Arg Asn Thr 
605 

etg gag ' gat 
Leu Glu Asp 



gee gtc age 
Ala Val Ser 



age tac tec 
Ser Tyr Ser 
655 

gge eta ggt 
Gly Leu Gly 
670 

gge ttc tac 
Gly Phe Tyr 

685 

gtg gta cet 
Val Val Pro 



ttt gtt ett 
Phe Val Leu 



gag get cac 
Glu Ala His 
735 

etg tge age 
Leu Cys Ser 
750 

ccc atg eca 
Pro Met Pro 
765 

ate etc gac 
He Leu Asp 



etg gat age 
Leu Asp Ser 



560 

etc 1728 
Leu 



tgg 1776 
Trp 



agt 1824 
Ser 



gag 1872 
Glu 



tae 1920 

Tyr 

640 

aga 1968 
Arg 



ggg 2016 
Gly 



tee 2064 
Ser 



ttc 2112 
Phe 



gag 2160 

Glu 

720 

att 2208 
He 



ete 2256 
Leu 



get 2304 
Ala 



ttc 2352 
Phe 



aag 2400 

Lys 
800 
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agg cag gag gcc tct cgc ccg gat gtc tta tgc tgc ttt tea acc egg 2448 
Arg Gin Glu Ala Ser Arg Pro Asp Val Leu Cys Cys Phe Ser Thr Arg 

805 810 815 

aag ctg ccc cca cct aaa gaa aaa gaa ggc etc tta ete ege tte ttc 2496 
Lys Leu Pro Pro Pro Lys Glu Lys Glu Gly Leu Leu Leu Arg Phe Phe 

820 825 830 

ego aag ata tac get cct ttc ctg ctg cac aga ttc ate cgc cct gtt 2544 
Arg Lys He Tyr Ala Pro Phe Leu. Leu His Arg Phe He Arg Pro Val 

835 840 845 

gtg atg ctg ctg ttt ctg acc ctg ttt gga gea aat etc tac tta atg 2592 
Val Met Leu Leu Phe Leu Thr Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 

tge aae ate aac gtg ggg eta gae eag gag etg get etg eee aag gae 2640 
Cys Asn He Asn Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 880 

teg tac ttg ata gae tac ttc etc ttt ctg aac ega tac ett gaa gtg 2 688 

Ser Tyr Leu He Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 

885 890 895 

ggg cct cca gtg tac ttt gtc acc acc teg ggc ttc aac ttc tec age 2736 
Gly Pro Pro Val Tyr Phe Val Thr Thr Ser Gly Phe Asn Phe Ser Ser 

900 905 910 

gag gea ggc atg aac gcc act tgc tct age gea ggc tgt aag age ttc 2784 
Glu Ala Gly Met Asn Ala Thr Cys Ser Ser Ala Gly Cys Lys Ser Phe 
915 920 925 

tee eta aee eag aaa ate eag tat gee agt gaa tte eet gae eag tet 2832 
Ser Leu Thr Gin Lys He Gin Tyr Ala Ser Glu Phe Pro Asp Gin Ser 
930 935 940 

tac gtg get att get gea tec tec tgg gta gat gae tte ate gae tgg 288 0 

Tyr Val Ala He Ala Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 

ctg acc ccg tec tec tec tgc tgt cgc ett tat ata cgt ggc ccc cat 2928 
Leu Thr Pro Ser Ser Ser Cys Cys Arg Leu Tyr He Arg Gly Pro His 

965 970 975 

aag gat gag tte tgt ccc tea acg gat act tec ttc aac tgc tta aaa 2976 
Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 

980 985 990 

aae tge atg aac ege aet etg ggt eet gtg agg eee aea geg gaa eag 3024 
Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Ala Glu Gin 
995 1000 1005 

ttt cat aag tac ctg ccc tgg tte etg aat gat eeg eee aat ate 3069 
Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Pro Pro Asn He 
1010 1015 1020 

aga tgt ccc aaa ggg ggt eta gea geg tat aga acg tct gtg aat 3114 
Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 
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ttg age tea gat ggc cag gtt ata gcc tec cag ttc atg gcc tac 3159 
Leu Ser Ser Asp Gly Gin Val lie Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 

cac aag ccc tta agg aac tea eag gae tte aea gaa get ete egg 3204 
His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 

geg tee egg ttg eta gea gee aae ate aea get gae eta egg aag 3249 
Ala Ser Arg Leu Leu Ala Ala Asn lie Thr Ala Asp Leu Arg Lys 
1070 1075 1080 

gtg cct ggg aca gat cca aac ttt gag gtc ttc cct tac acg ate 3294 
Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr He 
1085 1090 1095 

tec aac gtg ttc tac cag caa tac ctg acg gte ett eet gag gga 3339 
Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 

ate tte aee ett get ett tge ttt gtg eee aee ttt gtt gte tge 3384 
He Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 1125 

tae ete eta etg gge etg gae atg tge tea ggg ate ete aae eta 3429 
Tyr Leu Leu Leu Gly Leu Asp Met Cys Ser Gly He Leu Asn Leu 
1130 1135 1140 

ete tec ate att atg att etc gtg gae acc att ggc etc atg get 3474 
Leu Ser He He Met He Leu Val Asp Thr He Gly Leu Met Ala 
1145 1150 1155 

gtg tgg ggt ate age tat aat gcg gta tee etc ate aae ett gte 3519 
Val Trp Gly He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val 
1160 1165 1170 

aeg gea gtg gge atg tet gtg gag ttt gtg tee eae ate act egg 3564 
Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg 
1175 1180 1185 

tee ttt get gta age aee aag eet aee egg etg gag agg get aaa 3609 
Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 1200 

gat get act gtc ttc atg ggc agt gcg gtg ttt get gga gtg gcc 3654 
Asp Ala Thr Val Phe Met Gly Ser Ala Val Phe Ala Gly Val Ala 
1205 1210 1215 

atg acc aac ttc cca ggc ate etc ate ttg gge ttt gee caa gee 3699 
Met Thr Asn Phe Pro Gly He Leu He Leu Gly Phe Ala Gin Ala 
1220 1225 1230 

eag ett att eag ate ttc ttc tte ege etc aac ett etg ate aee 3744 
Gin Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr 
1235 1240 1245 

ttg ctg ggt ctg ctg cat gge etg gte ttc ctg ceg gtt gte etc 3789 
Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 



age tat etg gga cca gat gtt aac caa get ctg gta cag gag gag 



3834 
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Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Gin Glu Glu 
1265 1270 1275 

aaa eta gcc age gag gca gca gtg gcc cca gag cct tct tgc cca 3879 

Lys Leu Ala Ser* Glu Ala Ala Val Ala Pro Glu Pro Ser Cys Pro 

1280 1285 1290 

cag tac ccc tec cct get gat gcg gat gcc aat gtt aac tac ggc 3924 

Gin Tyr Pro Ser Pro Ala Asp Ala Asp Ala Asn Val Asn Tyr Gly 
1295 1300 1305 

ttt gcc cca gaa ctt gcc cac gga get aat get get aga age tct 3969 

Phe Ala Pro Glu Leu Ala His Gly Ala Asn Ala Ala Arg Ser Ser 
1310 1315 1320 

ttg ccc aaa agt gac caa aag ttc taa 3996 

Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 



<210> 2 

<211> 1331 

<212> PRT 

<213> Rattus sp 



<400> 2 

Met Ala Ala Ala Trp Leu Gly Trp Leu Leu Trp Ala Leu Leu Leu Ser 
15 10 15 



Ala Ala Gin Gly Glu Leu Tyr Thr Pro Lys His Glu Ala Gly Val Cys 

20 25 30 



Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 
35 40 45 



Thr Ser Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 



Val Thr Gly Glu His Leu Ala Leu Leu Gin Arg lie Cys Pro Arg Leu 
65 70 75 80 



Tyr Asn Gly Pro Asn Thr Thr Phe Ala Cys Cys Ser Thr Lys Gin Leu 

85 90 95 



Leu Ser Leu Glu Ser Ser Met Ser lie Ihr Lys Ala Leu Leu Thr Arg 

100 105 110 



Cys Pro Ala Cys Ser Asp Asn Phe Val Ser Leu His Cys His Asn Thr 
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115 



Cys Ser Pro Asp 
130 



Arg Gly Ala Gly 
145 



Gin Arg Ser Phe 



lie Pro Ala Ala 

180 



Gly Ser Ala Leu 
195 



Thr Gly Asn Gly 
210 



Pro Gly Gin Ala 
225 



Ala Pro Cys Asn 



Gin Asp Cys Ala 

260 



Arg Pro Ser Phe 
275 



lie lie Phe Thr 
290 



Leu Arg Val Val 

305 



Glu Ala Pro Lys 



Leu Gly Arg Phe 

340 



120 



Gin Ser Leu Phe 
135 



Glu Pro Pro Ala 
150 



Ala Glu Lys Ala 
165 



Ala Ser Leu Ala 



Cys Asn Ala Gin 

200 



Leu Ala Pro Leu 
215 



Leu Pro Asp Gly 
230 



Glu Ser Gin Gly 

245 



Ala Ser Cys Pro 



Tyr Met Gly Arg 

280 



Ala Val Phe Val 
295 



Ser Asn Arg Asn 
310 



Leu Pro His Lys 
325 



Phe Gin Asn Trp 
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lie Asn Val Thr 

140 



Val Val Ala Tyr 
155 



Tyr Glu Ser Cys 
170 



Val Gly Ser Met 
185 



Arg Trp Leu Asn 



Asp He Thr Phe 

220 



He Gin Pro Leu 
235 



Asp Asp Ser Ala 
250 



Val He Pro Pro 
265 



Met Pro Gly Trp 



Leu Leu Ser Ala 

300 



Lys Asn Lys Ala 
315 



His Lys Leu Ser 
330 



Gly Thr Arg Val 
345 



125 



Arg val Val Glu 



Glu Ala Phe Tyr 

160 



Ser Gin Val Arg 
175 



Cys Gly Val Tyr 
190 



Phe Gin Gly Asp 
205 



His Leu Leu Glu 



Asn Gly Lys He 

240 



Val Cys Ser Cys 
255 



Pro Glu Ala Leu 
270 



Leu Ala Leu He 
285 



Val Leu Val Arg 



Glu Gly Pro Gin 

320 



Pro His Thr He 
335 



Ala Ser Trp Pro 
350 



Leu Thr Val Leu Ala Leu Ser Phe He Val Val He Ala Leu Ala Ala 
355 360 365 
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Gly Leu Thr Phe lie Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 



Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ser Phe His Asp Glu His 
385 390 395 400 



Phe Gly Pro Phe Phe Arg Thr Asn Gin He Phe Val Thr Ala Arg Asn 

405 410 415 



Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Ser Lys Asn Phe 

420 425 430 



Ser Gly He Leu Ser Leu Asp Phe Leu Leu Glu Leu Leu Glu Leu Gin 
435 440 445 



Glu Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Glu Arg Asn 
450 455 460 



He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Tyr Asn Thr 
465 470 475 480 



Ser Leu Ser Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 

485 490 495 



Asn Arg Thr Leu Leu Met Leu Thr Ala Asn Gin Thr Leu Asn Gly Gin 

500 505 510 



Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 



Pro Leu Thr Phe Lys Asp Gly Thr Ser Leu Ala Leu Ser Cys Met Ala 
530 535 540 



Asp Tyr Gly Ala Pro Val Phe Pro Phe Leu Ala Val Gly Gly Tyr Gin 
545 550 555 560 



Gly Thr Asp Tyr Ser Glu Ala Glu Ala Leu He He Thr Phe Ser Leu 

565 570 575 



Asn Asn Tyr Pro Ala Asp Asp Pro Arg Met Ala Gin Ala Lys Leu Trp 

580 585 590 



Glu Glu Ala Phe Leu Lys Glu Met Glu Ser Phe Gin Arg Asn Thr Ser 
595 600 605 
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Asp Lys Phe Gin Val Ala Phe Ser Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 



lie Asn Arg Thr Thr lie Gin Asp Leu Pro Val Phe Ala Val Ser Tyr 
625 630 635 640 



lie lie Val Phe Leu Tyr lie Ser Leu Ala Leu Gly Ser Tyr Ser Arg 

645 650 655 



Cys Ser Arg Val Ala Val Glu Ser Lys Ala Thr Leu Gly Leu Gly Gly 

660 665 670 



Val lie Val Val Leu Gly Ala Val Leu Ala Ala Met Gly Phe Tyr Ser 
675 680 685 



Tyr Leu Gly Val Pro Ser Ser Leu Val lie lie Gin Val Val Pro Phe 
690 695 700 



Leu Val Leu Ala Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
705 710 715 720 



Tyr Gin Arg Leu Pro Arg Met Pro Gly Glu Gin Arg Glu Ala His He 

725 730 735 



Gly Arg Thr Leu Gly Ser Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

740 745 750 



Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 



Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala He He Leu Asp Phe 
770 775 780 



Leu Leu Gin Met Thr Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 



Arg Gin Glu Ala Ser Arg Pro Asp Val Leu Cys Cys Phe Ser Thr Arg 

805 810 815 



Lys Leu Pro Pro Pro Lys Glu Lys Glu Gly Leu Leu Leu Arg Phe Phe 

820 825 830 



Arg Lys He Tyr Ala Pro Phe Leu Leu His Arg Phe He Arg Pro Val 
835 840 845 
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Val Met Leu Leu Phe Leu Thr Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 

Cys Asn lie Asn Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
B65 870 875 880 

Ser Tyr Leu lie Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 

885 890 895 

Gly Pro Pro Val Tyr Phe Val Thr Thr Ser Gly Phe Asn Phe Ser Ser 

900 905 910 

Glu Ala Gly Met Asn Ala Thr Cys Ser Ser Ala Gly Cys Lys Ser Phe 
915 920 925 

Ser Leu Thr Gin Lys lie Gin Tyr Ala Ser Glu Phe Pro Asp Gin Ser 
930 935 940 

Tyr Val Ala lie Ala Ala Ser Ser Trp Val Asp Asp Phe lie Asp Trp 
945 950 955 960 

Leu Thr Pro Ser Ser Ser Cys Cys Arg Leu Tyr lie Arg Gly Pro His 

965 970 975 

Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 

980 985 990 

Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Ala Glu Gin 
995 1000 1005 

Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Pro Pro Asn lie 
1010 1015 1020 



Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 



Leu Ser Ser Asp Gly Gin Val He Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 



His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 



Ala Ser Arg Leu Leu Ala Ala Asn He Thr Ala Asp Leu Arg Lys 
1070 1075 1080 



Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr He 
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1085 1090 1095 



Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 



lie Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 1125 



Tyr Leu Leu Leu Gly Leu Asp Met Cys Ser Gly lie Leu Asn Leu 
1130 1135 1140 



Leu Ser He He Met He Leu Val Asp Thr He Gly Leu Met Ala 

1145 1150 1155 



Val Txp Gly He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val 
1160 1165 1170 



Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg 
1175 1180 1185 



Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 1200 



Asp Ala Thr Val Phe Met Gly Ser Ala Val Phe Ala Gly Val Ala 
1205 1210 1215 



Met Thr Asn Phe Pro Gly He Leu He Leu Gly Phe Ala Gin Ala 
1220 1225 1230 



Gin Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr 
1235 1240 1245 



Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 



Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Gin Glu Glu 
1265 1270 1275 



Lys Leu Ala Ser Glu Ala Ala Val Ala Pro Glu Pro Ser Cys Pro 
1280 1285 1290 



Gin Tyr Pro Ser Pro Ala Asp Ala Asp Ala Asn Val Asn Tyr Gly 
1295 1300 1305 



Phe Ala Pro Glu Leu Ala His Gly Ala Asn Ala Ala Arg Ser Ser 
1310 1315 1320 
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Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 

<210> 3 

<211> 3999 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (3999) 

<223> 



<400> 3 
atg gcg gag 
Met Ala Glu 
1 

cgc ttg gcc 
Arg Leu Ala 



tgc gcc ttc 
Cys Ala Phe 

35 

etc atg aca 
Leu Met Thr 
50 

aag ate aca 
Lys lie Thr 
65 

etc tac ace 

Leu Tyr Thr 



gta tea ctg 
Val Ser Leu 



tgc cca gcc 
Cys Pro Ala 
115 

tgc age cec 
Cys Ser Pro 



gee gge ctg 
Ala Gly Leu 
5 

cag agt gag 
Gin Ser Glu 
20 

tat gae gaa 
Tyr Asp Glu 



etc tec aac 
Leu Ser Asn 



ggt gat eae 
Gly Asp His 
70 

gge cec aac 
Gly Pro Asn 
85 

gaa gcg agt 
Glu Ala Ser 
100 

tgc tet gae 
Cys Ser Asp 



aat eag age 
Asn Gin Ser 



agg gge tgg 
Arg Gly Trp 



cet tac aca 

Pro Tyr Thr 
25 

tgt ggg aag 

Cys Gly Lys 
40 

gtg tee tgc 
Val Ser Cys 
55 

etg ate eta 
Leu lie Leu 



ace caa gcc 

Thr Gin Ala 



ctg teg ate 
Leu Ser lie 
105 

aat ttt gtg 
Asn Phe Val 
120 

etc ttc ate 
Leu Phe lie 



etg etg tgg 
Leu Leu Trp 
10 

ace ate cac 
Thr lie His 



aac cca gag 
Asn Pro Glu 



etg tec aac 
Leu Ser Asn 
60 

tta eag aag 
Leu Gin Lys 
75 

tgc tgc tec 
Cys Cys Ser 
90 

acc aag gcc 
Thr Lys Ala 



aac ctg cac 
Asn Leu His 



aat gtg ace 
Asn Val Thr 



gee etg etc 
Ala Leu Leu 
15 

cag cet gge 
Gin Pro Gly 
30 

ctg tct gga 
Leu Ser Gly 
45 

acg ccg gcc 
Thr Pro Ala 



ate tge eee 
lie Cys Pro 



gcc aag cag 

Ala Lys Gin 
95 

etc etc acc 
Leu Leu Thr 
110 

tgc cac aac 
Cys His Asn 
125 

ege gtg gee 
Arg Val Ala 



etg 48 
Leu 



tac 96 
Tyr 



age 144 
Ser 



cgc 192 
Arg 



ege 240 

Arg 

80 

etg 288 
Leu 



cgc 336 
Arg 



acg 384 
Thr 



eag 432 
Gin 
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130 135 140 

eta ggg get gga caa etc cca get gtg gtg gcc tat gag gcc ttc tac 480 
Leu Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

cag cat age ttt gcc gag cag age tat gac tec tge age cgt gtg cge 528 
Gin His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg 

165 170 175 

gte cct gca get gcc aeg etg get gtg ggc acc atg tgt ggc gtg tat 576 
Val Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr 

180 185 190 

ggc tot gcc ctt tgc aat gcc cag cgc tgg etc aac ttc cag gga gac 624 
Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 

aca ggc aat ggt etg gee cca etg gac ate ace ttc cae etc ttg gag 672 
Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 

cct ggc cag gcc gtg ggg agt ggg att cag cct etg aat gag ggg gtt 720 
Pro Gly Gin Ala Val Gly Ser Gly He Gin Pro Leu Asn Glu Gly Val 
225 230 235 240 

gca cgt tgc aat gag tec caa ggt gac gac gtg geg acc tge tec tgc 768 
Ala Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys 

245 250 255 

caa gac tgt get gca tec tgt cct gcc ata gcc cgc ccc cag gcc etc 816 
Gin Asp Cys Ala Ala Ser Cys Pro Ala He Ala Arg Pro Gin Ala Leu 

260 265 270 

gac tec ace ttc tac etg ggc cag atg ecg ggc agt etg gte etc ate 864 
Asp Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu He 
275 280 285 

ate ate etc tgc tet gte ttc get gtg gte ace ate etg ctt gtg gga 912 
He He Leu Cys Ser Val Phe Ala Val Val Thr He Leu Leu Val Gly 
290 295 300 

ttc cgt gtg gcc ccc gcc agg gac aaa age aag atg gtg gac ccc aag 960 
Phe Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys 
305 310 315 320 

aag ggc acc age etc tet gac aag etc age ttc tec acc cae acc etc 1008 
Lys Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu 

325 330 335 

ctt ggc cag ttc ttc cag ggc tgg ggc aeg tgg gtg get teg tgg cct 1056 
Leu Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro 

340 345 350 

etg acc ate ttg gtg eta tet gte ate ceg gtg gtg gee ttg gca geg 1104 
Leu Thr He Leu Val Leu Ser Val He Pro Val Val Ala Leu Ala Ala 
355 360 365 



ggc etg gte ttt aca gaa etc act aeg gac ccc gtg gag etg tgg teg 1152 
Gly Leu Val Phe Thr Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 
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gcc ccc aac 
Ala Pro Asn 
385 

ttc ggc ccc 
Phe Gly Pro 



egg tec age 
Arg Ser Ser 



age gga ate 
Ser Gly lie 
435 

gag agg ctg 
Glu Arg Leu 
450 

ate tec ctg 
lie Ser Leu 
465 

agt etc tae 
Ser Leu Tyr 



aae ege aeg 
Asn Arg Thr 



aee tee eaa 
Thr Ser Gin 
515 

eeg ete aee 
Pro Leu Thr 
530 

gae tae ggg 
Asp Tyr Gly 

545 

gga aag gae 
Gly Lys Asp 



aac aat tae 
Asn Asn Tyr 



gag gag gcc 
Glu Glu Ala 
595 

ggc atg ttc 
Gly Met Phe 
610 



age caa gcc 
Ser Gin Ala 
390 

ttc ttc cga 
Phe Phe Arg 
405 

tae agg tat 
Tyr Arg Tyr 
42 0 

ctg gae ctg 
Leu Asp Leu 



egg cac etc 
Arg His Leu 



cag gae ate 
Gin Asp lie 
470 

gae tge tge 
Asp Cys Cys 
485 

etc ctg ctg 
Leu Leu Leu 
500 

gtc gae tgg 
Val Asp Trp 



ttc aag gat 
Phe Lys Asp 



gcc cet gtc 
Ala Pro Val 
550 

tat tet gag 
Tyr Ser Glu 
565 

ect gcc ggg 
Pro Ala Gly 
580 

ttc tta gag 
Phe Leu Glu 



cag gtc aeg 
Gin Val Thr 



egg agt gag 
Arg Ser Glu 



aee aac cag 
Thr Asn Gin 



gae tet ctg 
Asp Ser Leu 
425 

gae ttg ctg 
Asp Leu Leu 
440 

cag gta tgg 
Gin Val Trp 
455 

tge tae gee 
Cys Tyr Ala 



ate aae age 
He Asn Ser 



etc aca gcc 
Leu Thr Ala 
505 

aag gae cat 
Lys Asp His 
520 

ggc aca gee 
Gly Thr Ala 
535 

ttc ccc ttc 
Phe Pro Phe 



gea gag gcc 
Ala Glu Ala 



gae ccc cgt 
Asp Pro Arg 
585 

gaa atg cga 
Glu Met Arg 
600 

ttc aeg get 
Phe Thr Ala 

615 



aaa get ttc 
Lys Ala Phe 
395 

gtg ate ctg 
Val He Leu 
410 

ctg ctg ggg 
Leu Leu Gly 



ctg gag ctg 
Leu Glu Leu 



teg ccc gaa 
Ser Pro Glu 
460 

ccc ete aat 

Pro Leu Asn 
475 

ete ctg cag 
Leu Leu Gin 

490 

aac cag aca 
Asn Gin Thr 



ttt ctg tac 
Phe Leu Tyr 



ctg gee ctg 
Leu Ala Leu 
540 

ett gee att 
Leu Ala He 

555 

ctg ate atg 
Leu He Met 
570 

ctg gcc cag 
Leu Ala Gin 



gee ttc cag 
Ala Phe Gin 



gag ege tet 
Glu Arg Ser 

620 



cat gae cag 
His Asp Gin 



aeg get cet 
Thr Ala Pro 
415 

ccc aag aae 
Pro Lys Asn 
43 0 

eta gag ctg 
Leu Glu Leu 
445 

gea cag ege 
Ala Gin Arg 



eeg gae aat 
Pro Asp Asn 



tat ttc cag 
Tyr Phe Gin 
495 

ctg atg ggg 
Leu Met Gly 
510 

tgt gcc aat 
Cys Ala Asn 
525 

age tge atg 
Ser Cys Met 



ggg ggg tae 

Gly Gly Ty^ 



aeg ttc tec 
Thr Phe Ser 
575 

gcc aag ctg 
Ala Lys Leu 
590 

cgt egg atg 
Arg Arg Met 
605 

ctg gaa gae 
Leu Glu Asp 



cat 1200 

His 

400 

aae 1248 
Asn 



ttc 1296 
Phe 



cag 1344 
Gin 



aac 1392 
Asn 



aee 1440 

Thr 
480 

aae 1488 
Asn 



cag 1536 
Gin 



gcc 1584 
Ala 



get 1632 
Ala 



aaa 1680 

Lys 

560 

etc 1728 
Leu 



tgg 1776 
Trp 



get 1824 
Ala 



gag 1872 
Glu 
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ate aat cgc 
lie Asn Arg 
625 

att gtc ata 
lie Val lie 



tgg age cga 
Trp Ser Arg 



gtg gcc gtg 
Val Ala Val 
675 

tac ttg ggt 
Tyr Leu Gly 
690 

ctg gtg ctg 

Leu Val Leu 
705 

tac cag agg 
Tyr Gin Arg 



ggg cga gcc 
Gly Arg Ala 



tct gag gcc 
Ser Glu Ala 
755 

gtg egg acc 
Val Arg Thr 
770 

etc ctg cag 
Leu Leu Gin 
785 

agg cag gag 
Arg Gin Glu 



gag ctg ccc 
Glu Leu Pro 



caa aag get 
Gin Lys Ala 
835 

gtg ctg ctg 
Val Leu Leu 
850 

tgc cac ate 



acc aca get 
Thr Thr Ala 
630 

ttc ctg tac 
Phe Leu Tyr 

645 

gtg atg gtg 
Val Met Val 
660 

gtc ctg gga 
Val Leu Gly 



ate cgc tec 
lie Arg Ser 



tec gtg ggg 

Ser Val Gly 
710 

ctg ccc egg 
Leu Pro Arg 
725 

eta ggc agg 
Leu Gly Arg 
740 

ate tgc ttc 
lie Cys Phe 



ttt gee ctg 
Phe Ala Leu 



atg tea gee 
Met Ser Ala 
790 

gee tec egg 
Ala Ser Arg 
805 

ceg cet ggc 
Pro Pro Gly 
820 

tat gee ccc 
Tyr Ala Pro 



ctg ttt etc 
Leu Phe Leu 



age gtg gga 



gaa gae ctg 
Glu Asp Leu 



ate tct ctg 
lie Ser Leu 



gae tee aag 
Asp Ser Lys 
665 

gea gtc atg 
Ala Val Met 
680 

tee ctg gtc 
Ser Leu Val 
695 

get gat aac 

Ala Asp Asn 



agg cet ggg 
Arg Pro Gly 



gtg get ccc 
Val Ala Pro 
745 

ttc eta ggg 
Phe Leu Gly 
760 

ace tct ggc 
Thr Ser Gly 
775 

ttt gtg gee 
Phe Val Ala 



ttg gae gtc 
Leu Asp Val 



cag gga gag 
Gin Gly Glu 
825 

ttc ctg ctg 
Phe Leu Leu 
840 

gee ctg ttc 
Ala Leu Phe 
855 

ctg gae cag 



ccc ate ttt 
Pro lie Phe 
635 

gee ctg ggc 
Ala Leu Gly 
650 

gee aeg ctg 
Ala Thr Leu 



get gee atg 
Ala Ala Met 



ate ctg caa 
lie Leu Gin 
700 

ate ttc ate 
lie Phe lie 
715 

gag cea cga 
Glu Pro Arg 
730 

age atg ctg 
Ser Met Leu 



gee ctg ace 
Ala Leu Thr 



ett gea gtg 
Leu Ala Val 
780 

ctg etc tee 
Leu Leu Ser 
795 

tgc tgc tgt 
Cys Cys Cys 
810 

ggg etc ctg 
Gly Leu Leu 



cac tgg ate 
His Trp lie 



gga gtg age 
Gly Val Ser 
860 

gag ctg gcc 



gee ace age 
Ala Thr Ser 



age tat tee 
Ser Tyr Ser 
655 

ggc etc ggc 
Gly Leu Gly 
670 

ggc ttc ttc 
Gly Phe Phe 
685 

gtg gtt cet 
Val Val Pro 



ttt gtt etc 
Phe Val Leu 



gag gtc cac 
Glu Val His 
735 

ttg tgc age 
Leu Cys Ser 
750 

ccc atg cea 
Pro Met Pro 
765 

7 
» 

ate ett gae 
lie Leu Asp 



ctg gae age 
Leu Asp Ser 



gtc aag ccc 
Val Lys Pro 
815 

ett ggc ttc 
Leu Gly Phe 
830 

act cga ggt 
Thr Arg Gly 
845 

etc tac tec 
Leu Tyr Ser 



ctg ccc aag 



tac 1920 

Tyr 

640 

age 1968 
Ser 



ggg 2016 
Gly 



tec 2064 
Ser 



ttc 2112 
Phe 



gag 2160 

Glu 
720 

att 2208 
He 



etc 2256 
Leu 



get 2304 
Ala 



ttc 2352 
Phe 



aag 2400 

Lys 

800 

cag 2448 
Gin 



ttc 2496 
Phe 



gtt 2544 

Val 



atg 2592 
Met 



gae 2 640 
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Cys His lie Ser Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 880 

teg tac ctg ctt gac tat ttc etc ttt ctg aac cgc tac ttc gag gtg 2 688 

Ser Tyr Leu Leu Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Phe Glu Val 

885 890 895 

ggg gcc ccg gtg tac ttt gtt acc acc ttg ggc tac aac ttc tec age 2736 
Gly Ala Pro Val Tyr Phe Val Thr Thr Leu Gly Tyr Asn Phe Ser Ser 

900 905 910 

gag get ggg atg aat gee ate tge tec agt gea ggc tgc aac aac ttc 2784 
Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Asn Asn Phe 
915 920 925 

tec ttc acc cag aag ate eag tat gee aca gag ttc ect gag eag tet 2832 
Ser Phe Thr Gin Lys He Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser 
930 935 940 

tac ctg gee ate ect gee tee tec tgg gtg gat gac ttc att gac tgg 2880 
Tyr Le\i Ala He Pro Ala Ser Ser Trp Val Asp Asp Phe He Asp Torp 
945 950 955 960 

ctg acc ccg tec tee tge tge cgc ctt tat ata tet ggc cec aat aag 2928 
Leu Thr Pro Ser Ser Cys Cys Arg Leu Tyr He Ser Gly Pro Asn Lys 

965 970 975 

gac aag ttc tgc cec teg acc gtc aac tet ctg aac tgc eta aag aac 2976 
Asp Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn 

980 985 990 

tgc atg age ate aeg atg ggc tet gtg agg cec teg gtg gag cag ttc 3024 
Cys Met Ser He Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe 
995 1000 1005 

eat aag tat ctt cec tgg ttc ctg aac gac egg cec aac ate aaa 3069 
His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn He Lys 
1010 1015 1020 

tgt cec aaa ggc ggc ctg gea gea tac age ace tet gtg aac ttg 3114 
Cys Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu 
1025 1030 1035 

act tea gat ggc cag gtt tta gee tec agg ttc atg gee tat cac 3159 
Thr Ser Asp Gly Gin Val Leu Ala Ser Arg Phe Met Ala Tyr His 
1040 1045 1050 

aag cec ctg aaa aac tea eag gat tac aca gaa get ctg egg gea 3204 
Lys Pro Leu Lys Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala 
1055 1060 1065 

get cga gag ctg gea gee aac ate act get gac ctg egg aaa gtg 3249 
Ala Arg Glu Leu Ala Ala Asn He Thr Ala Asp Leu Arg Lys Val 
1070 1075 1080 

ect gga aca gac ccg get ttt gag gtc ttc cec tac aeg ate ace 3294 
Pro Gly Thr Asp Pro Ala Phe Glu Val Phe Pro Tyr Thr He Thr 
1085 1090 1095 



aat gtg ttt tat gag cag tac ctg acc ate etc ect gag ggg etc 
Asn Val Phe Tyr Glu Gin Tyr Leu Thr He Leu Pro Glu Gly Leu 



3339 
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1100 



1105 



1110 



ttc atg etc age etc tgc ctt gtg ccc acc ttc get gtc tec tgc 
Phe Met Leu Ser Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys 
1115 1120 1125 



3384 



etc etg 
Leu Leu 
1130 



ctg ggc etg gae 
Leu Gly Leu Asp 



ctg 
Leu 
1135 



egc 
Arg 



tec 
Ser 



gge 

Gly 



etc 
Leu 



etc 
Leu 
1140 



aac ctg etc 
Asn Leu Leu 



3429 



tec att gtc atg ate etc gtg gae act gtc ggc ttc atg gee ctg 
Ser lie Val Met lie Leu Val Asp Thr Val Gly Phe Met Ala Leu 
1145 1150 1155 



3474 



tgg gae ate agt tac aat get gtg tee etc ate aac etg gtc teg 
Trp Asp lie Ser Tyr Asn Ala Val Ser Leu lie Asn Leu Val Ser 
1160 1165 1170 



3519 



geg gtg ggc atg tet gtg gag ttt gtg tec cac att ace egc tec 
Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg Ser 
1175 1180 1185 



3564 



ttt gcc ate age acc aag ccc acc tgg ctg gag agg gee aaa gag 
Phe Ala He Ser Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu 
1190 1195 1200 



3609 



gee ace ate tct atg gga agt geg gtg ttt gea ggt gtg gee atg 
Ala Thr He Ser Met Gly Ser Ala Val Phe Ala Gly Val Ala Met 
1205 1210 1215 



3654 



ace aac etg ect gge ate ctt gtc etg gge etc gee aag gee eag 
Thr Asn Leu Pro Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin 
1220 1225 1230 



3699 



etc att eag ate ttc ttc ttc egc etc aac etc ctg ate act etg 
Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu 
1235 1240 1245 



3744 



ctg gge etg etg eat gge ttg gtc ttc etg ccc gtc ate etc age 
Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val He Leu Ser 
1250 1255 1260 



3789 



tac gtg ggg cct gae gtt aac eeg get etg gea etg gag eag aag 
Tyr Val Gly Pro Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys 
1265 1270 1275 



3834 



egg get gag gag geg gtg gea gea gtc atg gtg gee tet tgc cea 
Arg Ala Glu Glu Ala Val Ala Ala Val Met Val Ala Ser Cys Pro 
1280 1285 1290 



3879 



aat cac ccc tec ega gtc tec aca get gae aac ate tat gtc aac 
Asn His Pro Ser Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn 
1295 1300 1305 



3924 



cac age ttt gaa ggt tet ate aaa ggt get ggt gee ate age aac 
His Ser Phe Glu Gly Ser He Lys Gly Ala Gly Ala He Ser Asn 
1310 1315 1320 



3969 



ttc ttg ccc aac aat ggg egg eag ttc tga 3999 
Phe Leu Pro Asn Asn Gly Arg Gin Phe 
1325 1330 
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<210> 4 

<211> 1332 

<212> PRT 

<213> Homo sapiens 



<400> 4 

Met Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu 
15 10 15 



Arg Leu Ala Gin Ser Glu Pro Tyr Thr Tlir lie His Gin Pro Gly Tyr 

20 25 30 



Cys Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser 
35 40 45 



Leu Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg 
50 55 60 



Lys lie Thr Gly Asp His Leu lie Leu Leu Gin Lys He Cys Pro Arg 
65 70 75 80 



Leu Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu 

85 90 95 



Val Ser Leu Glu Ala Ser Leu Ser He Thr Lys Ala Leu Leu Thr Arg 

100 105 110 



Cys Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr 
115 120 125 



Cys Ser Pro Asn Gin Ser Leu Phe He Asn Val Thr Arg Val Ala Gin 
130 135 140 



Leu Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 



Gin His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg 

165 170 175 



Val Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr 

180 185 190 
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Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 



Tlir Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 

210 215 220 



Pro Gly Gin Ala Val Gly Ser Gly lie Gin Pro Leu Asn Glu Gly Val 
225 230 235 240 



Ala Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys 

245 250 255 



Gin Asp Cys Ala Ala Ser Cys Pro Ala lie Ala Arg Pro Gin Ala Leu 

260 265 270 



Asp Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu lie 
275 280 285 



lie He Leu Cys Ser Val Phe Ala Val Val Thr He Leu Leu Val Gly 
290 295 300 



Phe Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys 
305 310 315 320 



Lys Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu 

325 330 335 



Leu Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro 

340 345 350 



Leu Thr He Leu Val Leu Ser Val He Pro Val Val Ala Leu Ala Ala 
355 360 365 



Gly Leu Val Phe Thr Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 



Ala Pro Asn Ser Gin Ala Arg Ser Glu Lys Ala Phe His Asp Gin His 
385 390 395 400 



Phe Gly Pro Phe Phe Arg Thr Asn Gin Val He Leu Thr Ala Pro Asn 

405 410 415 



Arg Ser Ser Tyr Arg Ty^r Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 

420 425 430 
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Ser Gly He lieu 
435 



Glu Arg Leu Arg 
450 



He Ser Leu Gin 
465 



Ser Leu Tyr Asp 



Asn Arg Thr Leu 

500 



Thr Ser Gin Val 
515 



Pro Leu Thr Phe 
530 



Asp Tyr Gly Ala 
545 



Gly Lys Asp Tyr 



Asn Asn Tyr Pro 

580 



Glu Glu Ala Phe 
595 



Gly Met Phe Gin 
610 



He Asn Arg Thr 

625 



He val He Phe 



Trp Ser Arg Val 

660 



Asp Leu Asp Leu 

440 



His Leu Gin Val 
455 



Asp He Cys Tyr 
470 



Cys Cys He Asn 
485 



Leu Leu Leu Thr 



Asp Trp Lys Asp 

520 



Lys Asp Gly Thr 
535 



Pro Val Phe Pro 
550 



Ser Glu Ala Glu 
565 



Ala Gly Asp Pro 



Leu Glu Glu Met 

600 



Val Thr Phe Thr 
615 



Thr Ala Glu Asp 
630 



Leu Tyr He Ser 
645 



Met Val Asp Ser 
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Leu Leu Glu Leu 



Trp Ser Pro Glu 

460 



Ala Pro Leu Asn 
475 



Ser Leu Leu Gin 
490 



Ala Asn Gin Thr 
505 



His Phe Leu Tyr 



Ala Leu Ala Leu 

540 



Phe Leu Ala He 
555 



Ala Leu He Met 
570 



Arg Leu Ala Gin 
585 



Arg Ala Phe Gin 



Ala Glu Arg Ser 

620 



Leu Pro He Phe 

635 



Leu Ala Leu Gly 
650 



Lys Ala Thr Leu 
665 



Leu Glu Leu Gin 
445 



Ala Gin Arg Asn 



Pro Asp Asn Thr 

480 



Tyr Phe Gin Asn 
495 



Leu Met Gly Gin 
510 



Cys Ala Asn Ala 
525 



Ser Cys Met Ala 



Gly Gly Tyr Lys 

560 



Thr Phe Ser Leu 
575 



Ala Lys Leu Trp 
590 



Arg Arg Met Ala 
605 



Leu Glu Asp Glu 



Ala Thr Ser Tyr 

640 



Ser Tyr Ser Ser 
655 



Gly Leu Gly Gly 
670 



Val Ala Val Val Leu Gly Ala Val Met Ala Ala Met Gly Phe Phe Ser 
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675 



Tyr Leu Gly lie 
690 



Leu Val Leu Ser 
705 



Tyr Gin Arg Leu 



Gly Arg Ala Leu 

740 



Ser Glu Ala lie 
755 



Val Arg Thr Phe 
770 



Leu Leu Gin Met 
785 



Arg Gin Glu Ala 



Glu Leu Pro Pro 

820 



Gin Lys Ala Tyr 
835 



Val Leu Leu Leu 
850 



Cys His lie Ser 

865 



Ser Tyr Leu Leu 



Gly Ala Pro Val 

900 



680 



Arg Ser Ser Leu 
695 



Val Gly Ala Asp 
710 



Pro Arg Arg Pro 
725 



Gly Arg Val Ala 



Cys Phe Phe Leu 

760 



Ala Leu Thr Ser 
775 



Ser Ala Phe Val 
790 



Ser Arg Leu Asp 

805 



Pro Gly Gin Gly 



Ala Pro Phe Leu 

840 



Phe Leu Ala Leu 
855 



Val Gly Leu Asp 
870 



Asp Tyr Phe Leu 
885 



Tyr Phe Val Thr 
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Val He Leu Gin 

700 



Asn He. Phe He 
715 



Gly Glu Pro Arg 
730 



Pro Ser Met Leu 

745 



Gly Ala Leu Thr 



Gly Leu Ala Val 

780 



Ala Leu Leu Ser 
795 



Val Cys Cys Cys 
810 



Glu Gly Leu Leu 
825 



Leu His Trp He 



Phe Gly Val Ser 

860 



Gin Glu Leu Ala 
875 



Phe Leu Asn Arg 
890 



Thr Leu Gly Tyr 
905 



685 



Val Val Pro Phe 



Phe Val Leu Glu 

720 



Glu Val His He 
735 



Leu Cys Ser Leu 
750 



Pro Met Pro Ala 
765 



He Leu Asp Phe 



Leu Asp Ser Lys 

800 



Val Lys Pro Gin 

815 



Leu Gly Phe Phe 
830 



Thr Arg Gly Val 
845 



Leu Tyr Ser Met 



Leu Pro Lys Asp 

880 



Tyr Phe Glu Val 
895 



Asn Phe Ser Ser 
910 



Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Asn Asn Phe 
915 920 925 
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Ser Phe Thr Gin Lys lie Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser 
930 935 940 



Tyr Leu Ala lie Pro Ala Ser Ser Trp Val Asp Asp Phe lie Asp Trp 
945 950 955 960 



Leu Thr Pro Ser Ser Cys Cys Arg Leu Tyr lie Ser Gly Pro Asn Lys 

965 970 975 



Asp Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn 

980 985 990 



Cys Met Ser He Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe 
995 1000 1005 



His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn He Lys 
1010 1015 1020 



Cys Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu 
1025 1030 1035 



Thr Ser Asp Gly Gin Val Leu Ala Ser Arg Phe Met Ala Tyr His 
1040 1045 1050 



Lys Pro Leu Lys Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala 
1055 1060 1065 



Ala Arg Glu Leu Ala Ala Asn He Thr Ala Asp Leu Arg Lys Val 
1070 1075 1080 



Pro Gly Thr Asp Pro Ala Phe Glu Val Phe Pro Tyr Thr He Thr 
1085 1090 1095 



Asn Val Phe Tyr Glu Gin Tyr Leu Thr He Leu Pro Glu Gly Leu 
1100 1105 1110 



Phe Met Leu Ser Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys 
1115 1120 1125 



Leu Leu Leu Gly Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu 
1130 1135 1140 



Ser He Val Met He Leu Val Asp Thr Val Gly Phe Met Ala Leu 
1145 1150 1155 
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Trp Asp lie Ser Tyr Asn Ala Val Ser Leu lie Asn Leu Val Ser 
1160 1165 1170 



Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg Ser 
1175 1180 1185 



Phe Ala He Ser Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu 
1190 1195 1200 



Ala Thr He Ser Met Gly Ser Ala Val Phe Ala Gly Val Ala Met 
1205 1210 1215 



Thr Asn Leu Pro Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin 
1220 1225 1230 



Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu 
1235 1240 1245 



Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val He Leu Ser 
1250 1255 1260 



Tyr Val Gly Pro Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys 
1265 1270 1275 



Arg Ala Glu Glu Ala Val Ala Ala Val Met Val Ala Ser Cys Pro 
1280 1285 1290 



Asn His Pro Ser Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn 
1295 1300 1305 



His Ser Phe Glu Gly Ser He Lys Gly Ala Gly Ala He Ser Asn 
1310 1315 1320 



Phe Leu Pro Asn Asn Gly Arg Gin Phe 
1325 1330 



<210> 5 

<211> 885 

<212> DNA 

<213> Rattus sp 



<400> 5 

ccacgcgtcc gcacctgcaa gtgtggtccc ctgaggcaga gcgcaacatc tccctccagg 60 
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acatctgcta 


tgcccccctc 


gcctccttca 


gtacttccag 


tgaatggcca 


gacctccctg 


ctctcacgtt 


caaagatggc 


ctgtcttccc 


cttccttgct 


cgctgatcat 


aaccttctct 


ccaagctctg 


ggaggaggct 


acaagttcca 


ggttgcgttc 


ccatccagga 


cctgcctgtc 


tggccctggg 


cagctactcc 


gcctaggtgg 


ggtgatagtg 




v-» V-» L> L> L» 


c t g u y y y o-y o 




cy gda-Octy oy 


a a =a =a /TfT =3 a 

CLCLCLciy y clocl*^ 


<210> 6 




<211> 458 




<212> DNA 




<213> Rattus sp. 


<400> 6 




gaccagatgt 


taaccaagct 



ccccagagcc ttcttgccca 
acggctttgc cccagaactt 
gtgaccaaaa gttctaatgg 
aaggtcttcc ctctggttgt 
ggcattcctg ccacgaggca 
taaggcttgg tcagattact 
gggatgcatc gctgttcatt 

<210> 7 



aacccatata acaccagcct ctccgactgc tgtgtcaaca 12 0 

aacaaccgca ccctcctgat gctcacggcc aaccagactc 180 

gtggactgga aggaccattt cctctactgt gcaaatgccc 240 

acgtctctgg ccctgagctg catggctgac tacggggctc 300 

gttgggggat accaaggcac ggactattcc gaggcagaag 360 

ctcaataact accccgctga tgatccccgc atggcccagg 420 

ttcttgaagg aaatggaatc cttccagagg aacacaagtg 48 0 

tcagctgagc gctctctgga ggatgagatc aaccgcacca 540 

tttgccgtca gctacattat cgtcttcctg tacatctccc 600 

agatgcagcc gagtagcggt ggagtccaag gctactctgg 660 

tgctgggagc agttctggct tgcatggggc ttctaactcc 72 0 

tctggttatc atccaagtgg tacctttcct ggtgcttaag 780 

tacatcctag acttgagtac cagaggtacc taggaagccg 840 

attgggcgca ccctgggcat gtggc 885 



ctggtacagg aggagaaact agccagcgag gcagcagtgg 60 

cagtacccct cccctgctga tgcggatgcc aatgttaact 12 0 

gcccacggag ctaatgctgc tagaagctct ttgcccaaaa 180 

agtaggagct tgtccatgct tctgctgatg agggatcatg 240 

cctcaaggcc tggggggagg ttgttcagag aaaaatggct 300 

accggcagct tggcactgac tccttggtct cataggtccc 360 

cctcatggag agactatctt aagtatctaa gctatcgatt 42 0 

aaaaaggcta tggctatg 458 
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<211> 896 
<212> DNA 
<213> Rattus sp- 



<400> 7 

ccacgcgtcc gcagtttcat aagtacctgc cctggttcct gaatgatccg cccaatatca 60 

gatgtcccaa agggggtcta gcagcgtata gaacgtctgt gaatttgagc tcagatggcc 120 

aggttatagc ctcccagttc atggcctacc acaagccctt aaggaactca caggacttca 180 

cagaagctct ccgggcgtcc cggttgctag cagccaacat cacagctgac ctacggaagg 240 

tgcctgggac agatccaaac tttgaggtct tcccttacac gatctccaac gtgttctacc 300 

agcaatacct gacggtcctt cctgagggaa tcttcaccct tgctctttgc tttgtgccca 360 

cctttgttgt ctgctacctc ctactgggcc tggacatgtg ctcagggatc ctcaacctac 420 

tctccatcat tatgattctc gtggacacca ttggcctcat ggctgtgtgg ggtatcagct 480 

ataatgcggt atccctcatc aaccttgtca cggcagtggg catgtctgtg gagtttgtgt 540 

cccacatcac tcggtccttt gcttgtaagc accaagccta cccggctgga gagggctaaa 600 

agatgctact gtcttcatgg gcagtgcggt gtttgctgga gtggccatga ccaacttccc 660 

aggcatcctc atcttggggg ctttgcccca agcccaggct tattcagatc ttcttcttcc 720 

gcctcaacct tctgatcacc tttgctgggg tctgctgcat ggctggtctt cctgcccggt 780 

ttgtcctcag ctatctggga ccagatgtaa ccaaggctct gctacccgga ggagaaacta 840 

gccagcgagg gcagcagtgg ccccagagac ttcttgccca caagtaccct tccctg 896 

<210> 8 
<211> 3124 
<212> DNA 
<213> Rattus sp. 



<400> 8 

tgcaagtgtg gtcccctgag gcagagcgca acatctccct ccaggacatc tgctatgccc 60 

ccctcaaccc atataacacc agcctctccg actgctgtgt caacagcctc cttcagtact 120 

tccagaacaa ccgcaccctc ctgatgctca cggccaacca gactctgaat ggccagacct 180 

ccctggtgga ctggaaggac catttcctct actgtgcaaa tgcccctctc acgttcaaag 240 

atggcacgtc tctggccctg agctgcatgg ctgactacgg ggctcctgtc ttccccttcc 3 00 
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ttcrctcittQa 


gggataccaa 


ggcacggact 


attccgaggc 


agaagcgctg 


atcataacct 


360 


tctctctcaa 


taac tacccc 


gctgatgatc 


cccgcatggc 


ccaggccaag 


c t c t gggagg 


420 


aaactttctt 


gaaggaaatg 


gaatccttcc 


agaggaacac 


aagtgacaag 


ttccaggttg 


480 


cattctcagc 


tgagcgctct 


ctggaggatg 


agatcaaccg 


caccaccatc 


caggacctgc 


540 


ctcrtctttQc 


eg t c age t ac 


attatcgtct 


tcctgtacat 


ctccctggcc 


c tgggcagc t 


600 


actccacjatcT 


cagccgaata 


gcggtggagt 

53 '•^i^ 53 ^ 23 S3 S3 


ccaaggctac 


tctgggccta 


ggtggggtga 


660 


t tcrt tcrtcrc t 


gggagcaatt 


ctggctgcca 


tgggcttcta 


ctcctacctg 


ggtgt cccc t 


720 


cttctctcTcrt 


tatcatccaa 


gtggtacctt 


tcctggtgct 


agctgtggga 

^-"^ ^3 yj yj ^3 53 


gctgacaaca 


780 


tcttcatctt 


tgttct tgacr 


taccagaggc 


tacctaggat 


gcctggggaa 


cagcgagagg 

^^^53 S3 S3 S3 Si/ 


840 


ctcacattcfcr 


cccfcaccctcf 


ggcagtgtgg 


cccccagcat 


gctgctgtgc 

53 S3 S3 S3 


agcctctctg 


900 


acraccatctzcr 


cttctttcta 


ggggccctga 

53 53 53 53 53 


cccccatgcc 


age tg t gagg 

53 53 53 53 53 


accttcgcct 


960 


tcracc tc tcrcr 


cttagcaatt 


atcctcgact 


tcctgctcca 


gatgactgcc 


tttgtggccc 


1020 


terete tccct 


agatagcaag 


aggcaggagg 


cctctcgccc 


ggatgtctta 


tgctgctttt 


1080 


caacccQcraa 


gctgccccca 


cc t aaagaaa 


aagaaggcct 


cttactccgc 


ttcttccgca 


1140 


aaatatacQc 


tcctttcctg 


c t gcacagat 


tcatccgccc 


tgttgtgatg 


ctgctgtttc 


1200 


tcjaccctcjtt 


tcrgagcaaat 

53 53 ^^^^^^ 


ctctacttaa 


tgtgcaacat 


c aac gt gggg 


c tagaccagg 


1260 


acjc t orcrc t c t 


cTcccaacrgac 


tcgtacttga 


tagactactt 


cctctttctg 


aaccgatacc 


1320 


ttaaaQtcjciQ 


acctccagtg 


tactttgtca 


c c ac c t cggg 


cttcaacttc 


tec agcga ag 

^ *^S3 ^^S3 S3 53 


1380 


c acrcrc a t craa 


caccacttgc 


tctagcgcag 


gctgtaagag 


cttctcccta 


acccagaaaa 


1440 


tccacrtatcrc 


cagt gaa 1 1 c 


cctgaccagt 


cttacgtggc 


tattgc tgca 


tcctcctggg 


1500 


tacratcractt 


catc crac t gg 


ctgaccccgt 


cctcctcctg 


c tgtcgcc tt 


tatatacgtg 


1560 


gcccccataa 


ggatgagttc 


tgtccctcaa 


cggatacttc 


cttcaactgc 


t taaaaaac t 


1620 


gcat aaaccg 


cactctgggt 


cc tg tgaggc 


ccacagcgga 


acagtttcat 


aagt acc t gc 


1680 


cctggttcct 


gaat gat ccg 


cccaatatca 


gatgtcccaa 


aggggg t c t a 


gcagcgtata 


1740 


gaacgtctgt 


gaat tt gage 


tcagatggcc 


aggttatagc 


ctcccagttc 


atggcctacc 


1800 


acaagccctt 


aagaaactca 


caggacttca 


c agaagc t c t 


ccgggcgtcc 

S3 vj* S? S3 


cggttgctag 

S3 «mI S3 S3 


1860 


cacrccaacat 


cacagctgac 


c t acggaagg 

53 53 53 S3 


tgcctgggac 

53 S3 S3 S3 ~ 


agatccaaac 


1 1 1 gagg t c t 


1920 


tcccttacac 


gatctccaac 


gtgttctacc 


agcaatacct 


gacggtcctt 


cctgagggaa 


1980 


tcttcaccct 


tgctctttgc 


tttgtgccca 


cctttgttgt 


ctgctacctc 


ctactgggcc 


2040 


tggacatgtg 


ctcagggatc 


ctcaacctac 


tctccatcat 


tatgattctc 


gtggacacca 


2100 


ttggcctcat 


ggctgtgtgg 


ggtatcagct 


ataatgcggt 


atccctcatc 


aaccttgtca 


2160 
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cggcagtggg 


catgtctgtg 


gagtttgtgt 


cccacatcac 


tcggtccttt 


gctgtaagca 


2220 


ccaagcctac 


ccggctggag 


agggctaaag 


atgctactgt 


cttcatgggc 


agtgcggtgt 


2280 


ttgctggagt 


ggccatgacc 


aacttcccag 


gcatcctcat 


cttgggcttt 


gcccaagccc 


2340 


agcttattca 


gatcttcttc 


ttccgcctca 


accttctgat 


caccttgctg 


ggtctgctgc 


2400 


atggcctggt 


cttcctgccg 


gttgtcctca 


gctatctggg 


accagatgtt 


aaccaagctc 


2460 


tggtacagga 


ggagaaacta 


gccagcgagg 


cagcagtggc 


cccagagcct 


tcttgcccac 


2520 


agtacccctc 


ccctgctgat 


gcggatgcca 


atgttaacta 


cggctttgcc 


ccagaacttg 


2580 


cccacggagc 


taatgctgct 


agaagctctt 


tgcccaaaag 


tgaccaaaag 


ttctaatgga 


2640 


gtaggagctt 


gtccatgctt 


cttgctgatg 


agggatcatg 


aaggtcttcc 


ctctggttgt 


2700 


cctcaaggcc 


tggggggagg 


ttgtttcaga 


gaaaaatggc 


tggcattcct 


gccacgaggc 


2760 


aaccggcagc 


attggcactg 


acctccttgc 


tctcataggt 


ccctaaggcc 


ttggtcagat 


2820 


tacctcctcc 


atggagagac 


tatcttaagt 


atcttaagta 


tcgtatggga 


tgcatcgcct 


2880 


gtcaattaaa 


aaggctatgg 


cctatggctc 


aggcagggcc 


atccggaaga 


agagaggatt 


2940 


ctgggataaa 


gccaggtggg 


agattcgcct 


ggggaaaatg 


tgacaatggt 


tcctgagcat 


3000 


gggcaatcag 


ccatgtggca 


gaatgtaaat 


taatataaat 


gggttgtctt 


aagttatgat 


3060 


tctagctggg 


gaggagccta 


gctgtgtagc 


caagatattt 


gtaaatataa 


aaaaaaaaaa 


3120 


aaaa 












3124 



<210> 9 

<211> 4484 

<212> DNA 

<213> Rattus sp. 



<400> 9 

atggcagctg cctggctggg atggctgctc tgggccctgc tcctgagcgc ggcccagggt 



60 



gagctataca cacccaaaca cgaagctggg gtctgcacct tttacgaaga gtgcgggaaa 



120 



aacccagagc tctctggagg cctcacgtca ctatccaatg tatcctgcct gtctaacacc 



180 



ccggcccgcc acgtcacggg tgaacacctg gctcttctcc agcgcatctg tccccgcctg 



240 



tacaacggcc ccaataccac ttttgcctgt tgctctacca agcagctgct gtccttagaa 



300 



agcagcatgt ccatcaccaa ggcccttctc acgcgctgcc cggcctgctc tgacaatttt 



360 



gtgagcttac actgccacaa cacttgcagc cctgaccaga gcctcttcat caacgtcacc 



420 
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caaatggtta 


agcgacracgc 

s»^5353>353^53^ 


tggagagcct 


cctcrccgtgg 

V** 53 S3 S3 


fcggcctatga 


ggccttttat 


480 


caaccjc age t 


ttgctgagaa 


ggcc tat gag 


tcctgcagcc 


aggtgcgcat 


ccctgcggcc 


540 


acttccttag 

2J \^ 


ccgtaggcag 


catgtgtgga 


gtgtatggct 

S7 53 S3 S3 


ccgccctctg 


caatgctcag 


600 


cac tggc t ca 


act tccaaga 


agacacaggg 


aatggcc tgg 

•^^^ w X3 ^3 \«» x.^ «M» 53 53 


ctccgctgga 


tatcaccttc 


660 


cacctcttgg 


agcctggcca 


ggccc taccg 


gatgggatcc 


agccactgaa 


tgggaagatc 


720 


CTcaccctgca 


acgaatctca 


ggg t ga t gac 


tcagcagtct 


gctcctgcca 


aaactgtgcg 

S3 ^3 ^ S^ ^ S3 S^ 


780 


cjcQtcctgcc 


ctcitcatccc 


tccgcccgag 


gccttgcgcc 


ct tccttcta 


catgggtcgc 

Vii^ w 53 53 53 w 53 


840 


atcrccaggct 


cTcrc t crcrc cc t 

53 53 53 53 ^^^-^ w» 


catcatcatc 


ttcactgctg 


tctttgtatt 


gctctctgca 


900 


CTtCCttQtCTC 


crtctcccracrt 


ggtttccaac 


aggaacaaga 


ac aaggcaga 


acr Qcccccag 


960 


craaCTCCCCca 


aactccctca 


taagcacaaa 


ctctcacccc 


ataccatcct 


QCfcrc ccrcr 1 1 c 

53 53 53 V— 53 53 Vi" 


1020 


ttccacraact 


cjcrcrcrc a c aacr 


ggt ggcc teg 

5353 53 53 Vii* ^Xn*53 


tggccactca 


ccgtcttagc 


actcrtccttc 


1080 


atccrt tcrtcra 


tacrcct tacrc 


acrcacTCTcctg 

53 53 53 w 53 


acct t tattg 


aac tcaccac 


acraccctcrtcr 


1140 


aaactgtggt 

53 53 53 53 %^ 


cggcccccaa 


gage c aggee 


cggaaagaga 


agtctttcca 


tgatgagcat 

x^ 53 X* %^ 53 x^ 53 


1200 


ttcggcccct 

yj 23 


tctttccraac 


caaccagatt 


ttcgtgacag 


ctcggaacag 


gtccagctac 


1260 


aagtacgact 


ccc tact get 


agggtccaag 


aacttcagtg 


ggatcctgtc 

53 53 \^ \^ 53 \^ 


cctggacttc 

x^ 53 S3 ^^^^^ 


1320 


c tCTc t ggagc 

^^53 %^53 53 ^^53 


t gc t ggagc t 

53 %^ 53 53 x^ 53 


teaggagagg 


cttcgacacc 


tgcaagtgtg 

^ 53 ^-*^**"53 ^53 ^^3 


gtcccctgag 

53 \^ 53 53 


1380 


CTcaQacrccjca 


acat c tccct 


ccaQgacatc 


tgctatcrccc 


CCC t caaccc 


atataacacc 


1440 


acrcctctcccT 


actQctcrtcrt 


caacagcetc 


cttcacrtact 


tccacraacaa 


ccoca^cicicifLCi 

53 ^^^^ ^ 


1500 

•X> \J \J 


ctgatCTctca 


CCTCTccaacca 

^^5353 ^^^^ 


gaetc tgaat 


ggccagacct 

53 53 ^^^S3 ^^^^ 


CCC t crcrtcrcra 


c t crcraacrcrac 


1560 


catt tcctct 


actgtgcaaa 


tgcecctcte 


accrt t caaag 


a t crcrc accrt c 

53 53 vA^^ 53 


tc tcrcrccctcr 

x^ 53 23 x^^^ ^ 53 


1620 


aCTctgcatgg 


ctgactacgg 


ggctcctgte 

53 53 \^ ^ 53 


ttccccttcc 


1 1 crc t cr 1 1 crcr 


crcrcrataccaa 


1680 


ggcacgaact 

53 53 w 53 53 


attccgaggc 


agaagcgctg 


atcataacct 


tctctctcaa 


taactacccc 


1740 


gctgatgatc 


cccgcatggc 


ccaggccaag 


etc t gggagg 

\^ 53 53 53 ^^53 53 


aggctttctt 


gaaggaaatg 

53 XAVA 53 53 53 


1800 


gaatccttcc 


agaggaacac 


aag t gacaag 


ttccaggttg 

*^ 53 S3 S3 


cgttctcagc 


tgagcgctct 


1860 


ctggagaatg 

w WF 53 53 VA53 53 53 


agatcaacca 


caccaccatc 


caggacctgc 


ctgtctttgc 


cgtcagctac 

X^ 53 X^ X^ 


1920 


attatcCTtct 


tec tgt acat 


ctccctggcc 


e t crcrcrcacrc t 


actccaCTatcT 


c acre CQacrt a 

53 53 53 ^ 


1980 


gcgataaaat 


ccaaggctac 


tctgggccta 


ggtggggtga 


ttgttgtgct 


gggagcagt t 

23 23 23 ***"23 ^ 


2040 


ctggctgcca 


tgggcttcta 


ctcctacctg 


ggtgtcccct 


cttctctggt 


tatcatecaa 


2100 


gtggtacctt 


tcctggtgct 


agctgtggga 


getgacaaca 


tcttcatctt 


tgttettgag 


2160 


taccagaggc 


tacctaggat 


gcctggggaa 


cagegagagg 


ctcacattgg 


ccgeaccetg 


2220 


ggcagtgtgg 


cccccagcat 


gctgctgtgc 


agectctctg 


aggccatctg 


ettctttcta 


2280 
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ggggccctga 


cccccatgcc 


agctgtgagg 


accttcgcct 


tgacctctgg 


cttagcaatt 


2340 


atcctcgact 


tcctgctcca 


gatgactgcc 


tttgtggccc 


tgctctccct 


ggatagcaag 


2400 


aggcaggagg 


cctctcgccc 


ggatgtctta 


tgctgctttt 


caacccggaa 


gctgccccca 


2460 


cctaaagaaa 


aagaaggcct 


cttactccgc 


ttcttccgca 


agatatacgc 


tcctttcctg 


2520 


ctgcacagat 


tcatccgccc 


tgttgtgatg 


ctgctgtttc 


tgaccctgtt 


tggagcaaat 


2580 


ctctacttaa 


tgtgcaacat 


caacgtgggg 


ctagaccagg 


agctggctct 


gcccaaggac 


2640 


tcgtacttga 


tagactactt 


cctctttctg 


aaccgatacc 


ttgaagtggg 


gcctccagtg 


2700 


tactttgtca 


ccacctcggg 


cttcaacttc 


tccagcgagg 


caggcatgaa 


cgccacttgc 


2760 


tctagcgcag 


gctgtaagag 


cttctcccta 


acccagaaaa 


tccagtatgc 


cagtgaattc 


2820 


cctgaccagt 


cttacgtggc 


tattgctgca 


tcctcctggg 


tagatgactt 


catcgactgg 


2880 


ctgaccccgt 


cctcctcctg 


ctgtcgcctt 


tatatacgtg 


gcccccataa 


ggatgagttc 


2940 


tgtccctcaa 


cggatacttc 


cttcaactgc 


ttaaaaaact 


gcatgaaccg 


cactctgggt 


3000 


cctgtgaggc 


ccacagcgga 


acagtttcat 


aagtacctgc 


cctggttcct 


gaatgatccg 


3060 


cccaatatca 


gatgtcccaa 


agggggtcta 


gcagcgtata 


gaacgtctgt 


gaatttgagc 


3120 


tcagatggcc 


aggttatagc 


ctcccagttc 


atggcctacc 


acaagccctt 


aaggaactca 


3180 


caggacttca 


cagaagctct 


ccgggcgtcc 


cggttgctag 


cagccaacat 


cacagctgac 


3240 


ctacggaagg 


tgcctgggac 


agatccaaac 


tttgaggtct 


tcccttacac 


gatctccaac 


3300 


gtgttctacc 


agcaatacct 


gacggtcctt 


cctgagggaa 


tcttcaccct 


tgctctttgc 


3360 


tttgtgccca 


cctttgttgt 


ctgctacctc 


ctactgggcc 


tggacatgtg 


ctcagggatc 


3420 


ctcaacctac 


tctccatcat 


tatgattctc 


gtggacacca 


ttggcctcat 


ggctgtgtgg 


3480 


ggtatcagct 


ataatgcggt 


atccctcatc 


aaccttgtca 


cggcagtggg 


catgtctgtg 


3540 


gagtttgtgt 


cccacatcac 


tcggtccttt 


gctgtaagca 


ccaagcctac 


ccggctggag 


3600 


agggctaaag 


atgctactgt 


cttcatgggc 


agtgcggtgt 


ttgctggagt 


ggccatgacc 


3660 


aacttcccag 


gcatcctcat 


cttgggcttt 


gcccaagccc 


agcttattca 


gatcttcttc 


3720 


ttccgcctca 


accttctgat 


caccttgctg 


ggtctgctgc 


atggcctggt 


cttcctgccg 


3780 


gttgtcctca 


gctatctggg 


accagatgtt 


aaccaagctc 


tggtacagga 


ggagaaacta 


3840 


gccagcgagg 


cagcagtggc 


cccagagcct 


tcttgcccac 


agtacccctc 


ccctgctgat 


3900 


gcggatgcca 


atgttaacta 


cggctttgcc 


ccagaacttg 


cccacggagc 


taatgctgct 


3960 


agaagctctt 


tgcccaaaag 


tgaccaaaag 


ttctaatgga 


gtaggagctt 


gtccatgctt 


4020 


cttgctgatg 


agggatcatg 


aaggtcttcc 


ctctggttgt 


cctcaaggcc 


tggggggagg 


4080 
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ttgtttcaga 


gaaaaatggc 


tggcattcct 


gccacgaggc 


aaccggcagc 


attggcactg 


4140 


acctccttgc 


tctcataggt 


ccctaaggcc 


ttggtcagat 


tacctcctcc 


atggagagac 


4200 


tatcttaagt 


atcttaagta 


tcgtatggga 


tgcatcgcct 


gtcaattaaa 


aaggctatgg 


4260 


cctatggctc 


aggcagggcc 


atccggaaga 


agagaggatt 


ctgggataaa 


gccaggtggg 


4320 


agattcgcct 


ggggaaaatg 


tgacaatggt 


tcctgagcat 


gggcaatcag 


ccatgtggca 


4380 


gaatgtaaat 


taatataaat 


gggttgtctt 


aagttatgat 


tctagctggg 


gaggagccta 


4440 


gctgtgtagc 


caagatattt 


gtaaatataa 


aaaaaaaaaa 


aaaa 




4484 



<210> 10 



<211> 
<212> 



3993 
DNA 



<213> Rattus sp 



<400> 10 

atggcngcng cntggytngg ntggytnytn tgggcnytny tnytnwsngc ngcncarggn 
gairytntaya cnccnaarca ygargcnggn gtntgyacnt tytaygarga rtgyggnaar 
aayccngary tnwsnggngg nytnacnwsn ytnwsnaayg tnwsntgyyt nwsnaayacn 
ccngcnmgnc aygtnacngg ngarcayytn gcnytnytnc armgnathtg yccnmgnytn 
tayaayggnc cnaayacnac nttygcntgy tgywsnacna arcarytnyt nwsnytngar 
wsnwsnatgw snathacnaa rgcnytnytn acnmgntgyc cngcntgyws ngayaaytty 
gtnwsnytnc aytgycayaa yacntgywsn ccngaycarw snytnttyat haaygtnacn 
mgngtngtng armgnggngc nggngarccn ccngcngtng tngcntayga rgcnttytay 
carmgnwsnt tygcngaraa rgcntaygar wsntgywsnc argtnmgnat hccngcngcn 
gcnwsnytng cngtnggnws natgtgyggn gtntayggnw sngcnytntg yaaygcncar 
mgntggytna ayttycargg ngayacnggn aayggnytng cnccnytnga yathacntty 
cayytnytng arccnggnca rgcnytnccn gayggnathc arccnytnaa yggnaarath 
gcnccntgya aygarwsnca rggngaygay wsngcngtnt gywsntgyca rgaytgygcn 
gcnwsntgyc cngtnathcc nccnccngar gcnytnmgnc cnwsnttyta yatgggnmgn 
atgccnggnt ggytngcnyt nathathath ttyacngcng tnttygtnyt nytnwsngcn 
gtnytngtnm gnytnmgngt ngtnwsnaay mgnaayaara ayaargcnga rggnccncar 
gargcnccna arytnccnca yaarcayaar ytnwsnccnc ayacnathyt nggnmgntty 
ttycaraayt ggggnacnmg ngtngcnwsn tggccnytna cngtnytngc nytnwsntty 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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atha tngtna 


ttiacnvtnac 


n Q cngqxiv t n 


acn t tva th.g 


cjarv t n t Q aw 


siiQcnccnaa 


rwsncaracn 


mcrnaarcrair a 


t tvaanc cnt 


ty t tymgnac 


naavcarath: 


t tvcrtnacng 


aaxtaVQ^ayw 


snvtnvtnvt 


ngcmwsnaax 


aav t tvwsng 


vtnvtnga3ry 


tnytnaarvt 


ncaargarmcm 


v t mticmc aw 


acncraiTinciTia 


a va tliws nv t 


ncarcravatli 


t crvt avcjcn c 


w<=; nv tnws ncf 


av t crvt crvcT t 


naavwsnvlri 


vtncairtavt 


vtnatcrvtna 


cncrcnaavca 


racnvtnaav 


cTcmcairacnw 


cavt twtnt 


av t Qvcrcnaa 


vcrcn c cnv t n 


acnttvaaircr 

(m«4.Nm^XX *-^wi^Ji* 53 


wciTi 1" crva t" crcr 


cncrav t avcrcr 


ncrcnc cncT t" n 


i* t vccnt" t w 


rrrrn a n cr a V 1" 


avwsncraircrc 


no'ai'crcnvtn 

XX VI K^Jm V^XX_]r V«XX 


athathacnt 

^XXwL l^XxwLV^XX \^ 


cfcncravcravc 


ciunona t crcrc 


nc aircrcnaair 


vtntcTcra'aircf 

J *-*xx ^53 53 53 53 


crarwsnttvc 


airnicnnaavac 


nwsncravaair 


t tvc aarcrtncr 


V t n CTaiTCT aver 


a ira t ha a vincr 


nacnacnath. 


caiTcrawtnc 


atliatlicrtnt 


twtntavat 


liwsnvtncfcn 


vtncjcmwsnt 


cTcncrtncrairw 


snaaiTCTcnac 


n V t ncrcmv tn 

xxjr ^xx>^ vjxx y 


crcmcrcincrtna 


vtncTcncrcna 


tcTCTciTit tvta 


vwsntawtn 


CTCfncrtnccnw 


crtncrtnccnt 


twtncftnvt 


n crcncj triCTcan 


crcnCTavaava 

53 %^xx>-| wx y v^^^ y 


t avc airmcrnv 


tncciuncmat 


crccncrancrair 


cairmcmcraircr 

^^^lAx^M Ahb53 AA53 ^^^aA^ 53 


CTcrnwsncr t riQ 


cnccnwsnat 


crv t nv tn t cpy 


wsnvtnwsncT 

w *^xx_y Vri^xxw *^AA53 


crcmcrcnvtna 

yyxxyv^xxjr ^^xxt* 


cnccna t ctcc 


nCTcncrtnincm 


acn 1 1 vcrcn v 


athvtncravt 


twtnvtnca 


xa t cracncrcn 


t tvcr t ncrcn V 




cnwsnmcrncc 


ncravcrtnvtn 


tcrvtcrvttvw 

**53_]f **53Jl ^ 


c cnaaiTciaira 


argarggnyt 


nytnytnmgn 


1 1 V 1 1 vxticf n a 


V t n c avman t 


ty a t tungnc c 


natncrtnata 


vtnvtnttw 


vtnta'wtna 


tcrtovaavat 


liaavQtnQQn 


V t ncravc aiTCT 


wsntayytna 


thgaytaytt 


yytnttyytn 


aaymgn t ayy 


tayttygtna 


cnacnwsngg 


nttyaaytty 


wsnwsngarg 


wsnwsngcng 


gntgyaarws 


nttywsnytn 


acncaraara 


ccngaycarw 


sntaygtngc 


nathgcngcn 


wsnwsntggg 
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arytnacnac 


ngayccngtn 


1140 


aiiTtfsnttyca 


vgavcrax c ay 


1200 


cnmgnaaymg 


nwsnwsn t ay 


1260 


gnatliytnws 


nv tncjav t ty 


1320 


tncargtntg 


gwsnccngaz* 


1380 


cnytnaaycc 


nt avaavacn 


1440 


tvcaraavaa 


vmcmacnvtn 


1500 


snvtnatnga 


vtcTcraaarcrav 

JT ^53 53 53 


1560 


avcrcmacnws 


nv tncrcn V t n 


1620 


t ncTcncT t ncrcr 

*»**53 ^>^**53 *»**sJ 53 


ncTcmtavcair 

XX53 WJ XX \m^\^mMm 


1680 


tvwsnvtnaa 


vaav t av c cn 


1740 


aiTCf cn t twt 


naaiTcraira t cf 


1800 


cn 1 1 vwsn crc 


nerajnuuiiw sn 


1860 


cncrtnttvcTC 


ncrtnwsntav 


1920 


avwsnmcfn t ct 


vwsniQCf ncT tn 


1980 


thcrtncrtnvt 


ncTcmcrcn cr t n 

^^53 534A»ifc53 %a^AA53 ^^^^ 


2040 


snwsnvtnQt 


nathathcair 


2100 


thttvathtt 


vcr tnv tn crair 


2160 


cncava tlicrcr 


nmgnacnytn 


2220 


arcTcna t li t cr 


V 1 1 V 1 1 w tn 


2280 


tnacnwsncTcr 

^M^Aali WW Mi'*A23 


nv tncrcn a t h 


2340 


t ny t nws ny t 


ngavwsnaar 


2400 


snacnmcinaa 


rvtnccnccn 


2460 


arathtaygc 


nccnt tyy tn 


2520 


tnacnv t n 1 1 


vcrcingcnaav 

jr ^ 23^^53 ^^^'•^^.JT 


2580 


aiTv t ncrcnv t 


nccnaaiTcrav 


2640 


tngargtngg 


nccnccngtn 


2700 


cnggnatgaa 


ygcnacntgy 


2760 


the art ay gc 


nwsngartty 


2820 


tngaygaytt 


yathgaytgg 


2880 
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ytnacnccnw snwsnwsntg ytgymgnytn tayathmgng gnc'cncayaa rgaygartty 2940 

tgyccnwsna cngayacnws nttyaaytgy ytnaaraayt gyatgaaymg nacnytnggn 3000 

ccngtiungnc cnacngcnga rcarttycay aartayytnc cntggttyyt naaygayccn 3060 

ccnaayathm gntgyccnaa rggnggnytn gcngcntaym gnacnwsngt naayytnwsn 3120 

wsngayggnc argtnathgc nwsncartty atggcntayc ayaarccnyt nmgnaaywsn 3180 

cargayttya cngargcnyt nmgngcnwsn mgnytnytng cngcnaayat hacngcngay 3240 

ytnmgnaarg tnccnggnac ngayccnaay ttygargtnt tyccntayac nathwsnaay 3300 

gtnttytayc arcartayyt nacngtnytn ccngarggna thttyacnyt ngcnytntgy 3360 

ttygtnccna cnttygtngt ntgytayytn ytnytnggny tngayatgtg ywsnggnath 3420 

ytnaayytny tnwsnathat hatgathytn gtngayacna thggnytnat ggcngtntgg 3480 

ggnathwsnt ayaaygcngt nwsnytnath aayytngtna cngcngtngg natgwsngtn 3540 

garttygtnw sncayathac nmgnwsntty gcngtnwsna cnaarccnac nmgnytngar 3600 

mgngcnaarg aygcnacngt nttyatgggn wsngcngtnt tygcnggngt ngcnatgacn 3660 

aayttyccng gnathytnat hytnggntty gcncargcnc arytnathca rathttytty 3720 

ttymgnytna ayytnytnat hacnytnytn ggnytnytnc ayggnytngt nttyytnccn 3780 

gtngtnytnw sntayytngg nccngaygtn aaycargcny tngtncarga rgaraarytn 3840 

gcnwsngarg cngcngtngc nccngarccn wsntgyccnc artayccnws nccngcngay 3900 

gcngaygcna aygtnaayta yggnttygcn ccngarytng cncayggngc naaygcngcn 3960 

mgnwsnwsny tnccnaarws ngaycaraar tty 3993 

<210> 11 
<211> 4002 
<212> DNA 
<213> Mus sp. 



<220> 

<221> CDS 

<222> (1) . . (4002) 

<223> 



<400> 11 

atg gca get gcc tgg cag gga tgg ctg etc tgg gcc ctg etc etg aat 



48 
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Met Ala Ala Ala Trp Gin Gly Trp Leu Leu Trp Ala Leu Leu Leu Asn 
15 10 15 

teg gcc cag ggt gag etc tac aca ccc act cac aaa get ggc ttc tgc 96 
Ser Ala Gin Gly Glu Leu Tyr Thr Pro Thr His Lys Ala Gly Phe Cys 

20 25 30 

acc ttt tat gaa gag tgt ggg aag aac oca gag ctt tct gga ggc etc 144 
Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 
35 40 45 

aca tea eta tec aat ate tee tgc ttg tct aat ace eca gee cge cat 192 
Thr Ser Leu Ser Asn lie Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 

gtc aca ggt gae cac ctg get ctt etc cag cge gtc tgt ccc cge eta 240 
Val Thr Gly Asp His Leu Ala Leu Leu Gin Arg Val Cys Pro Arg Leu 
65 70 75 80 

tac aat ggc ccc aat gac acc tat gcc tgt tgc tct acc aag cag ctg 288 
Tyr Asn Gly Pro Asn Asp Thr Tyr Ala Cys Cys Ser Thr Lys Gin Leu 

85 90 95 

gtg tea tta gac agt age ctg tct ate ace aag gee etc ctt aca cge 33 6 

Val Ser Leu Asp Ser Ser Leu Ser lie Thr Lys Ala Leu Leu Thr Arg 

100 105 110 

tgc ecg gea tgc tct gaa aat ttt gtg age ata cac tgt eat aat ace 384 
Cys Pro Ala Cys Ser Glu Asn Phe Val Ser lie His Cys His Asn Thr 
115 120 125 

tgc age cet gae cag age etc ttc ate aat gtt act cge gtg gtt cag 432 
Cys Ser Pro Asp Gin Ser Leu Phe lie Asn Val Thr Arg Val Val Gin 
130 135 140 

egg gac cct gga cag ctt cet get gtg gtg gcc tat gag gcc ttt tat 480 
Arg Asp Pro Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 

caa cge agt ttt gea gag aag gcc tat gag tec tgt age egg gtg cge 528 
Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Arg Val Arg 

165 170 175 

ate cet gea get gee teg ctg get gtg ggc age atg tgt gga gtg tat 576 
lie Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 

180 185 190 

ggc tct gee etc tgc aat get cag cge tgg etc aac ttc caa gga gac 624 
Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 

aca ggg aat ggc ctg get ecg ctg gac ate acc ttc cac etc ttg gag 672 
Thr Gly Asn Gly Leu Ala Pro Leu Asp lie Thr Phe His Leu Leu Glu 
210 215 220 

cct ggc cag gcc ctg gea gat ggg atg aag eca ctg gat ggg aag ate 720 
Pro Gly Gin Ala Leu Ala Asp Gly Met Lys Pro Leu Asp Gly Lys He 
225 230 235 240 



aca ccc tgc aat gag tee cag ggt gaa gae teg gea gee tgt tee tgc 
Thr Pro Cys Asn Glu Ser Gin Gly Glu Asp Ser Ala Ala Cys Ser Cys 



768 
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245 250 255 

cag gac tgt gca gca tec tgc cct gtc ate cct ccg ccc ccg gcc ctg 816 
Gin Asp Cys Ala Ala Ser Cys Pro Val He Pro Pro Pro Pro Ala Leu 

260 265 270 

cgc cct tct ttc tac atg ggt cga atg cca ggc tgg ctg get etc ate 8 64 

Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu He 
275 280 285 

ate ate ttc act get gtc ttt gta ttg etc tct gtt gtc ctt gtg tat 912 
He He Phe Thr Ala Val Phe Val Leu Leu Ser Val Val Leu Val Tyr 
290 295 300 

etc cga gtg get tec aac agg aac aag aac aag aca gca ggc tec cag 960 
Leu Arg Val Ala Ser Asn Arg Asn Lys Asn Lys Thr Ala Gly Ser Gin 
305 310 315 320 

gaa gee ccc aac etc cct cgt aag cgc aga ttc tea cct eac act gtc 1008 
Glu Ala Pro Asn Leu Pro Arg Lys Arg Arg Phe Ser Pro His Thr Val 

325 330 335 

ctt ggc egg ttc ttc gag age tgg gga aca agg gtg gcc tea tgg cea 1056 
Leu Gly Arg Phe Phe Glu Ser Trp Gly Thr Arg Val Ala Ser Trp Pro 

340 345 350 

etc act gtc ttg gca ctg tec ttc ata gtt gtg ata gcc ttg tea gta 1104 
Leu Thr Val Leu Ala Leu Ser Phe He Val Val He Ala Leu Ser Val 
355 360 365 

ggc ctg ace ttt ata gaa etc ace aca gac cct gtg gaa ctg tgg teg 1152 
Gly Leu Thr Phe He Glu Leu Thr Thr Asp Pro Val Glu Leu Tarp Ser 
370 375 380 

gee cct aaa age caa gee egg aaa gaa aag get ttc cat gac gag cat 1200 
Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ala Phe His Asp Glu His 
385 390 395 400 

ttt ggc ccc ttc ttc cga acc aac cag att ttt gtg aca get aag aac 1248 
Phe Gly Pro Phe Phe Arg Thr Asn Gin He Phe Val Thr Ala Lys Asn 

405 410 415 

agg tee age tac aag tac gac tec ctg ctg eta ggg ccc aag aac ttc 1296 
Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 

420 425 430 

agt ggg ate eta tee ctg gac ttg ctg cag gag ctg ttg gag eta cag 1344 
Ser Gly He Leu Ser Leu Asp Leu Leu Gin Glu Leu Leu Glu Leu Gin 
435 440 445 

gag aga ctt cga eac ctg caa gtg tgg tee eat gag gca cag cgc aac 13 92 

Glu Arg Leu Arg His Leu Gin Val Trp Ser His Glu Ala Gin Arg Asn 
450 455 460 

ate tec etc cag gac ate tgc tat get ccc etc aac ccg cat aac ace 1440 
He Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro His Asn Thr 
465 470 475 480 



age etc act gac tgc tgt gtc aac age etc ctt caa tac ttc cag aac 

Ser Leu Thr Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 

485 490 495 



1488 
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aac cac aca 
Asn His Thr 



acc tec ctg 
Thr Ser Leu 
515 

cct etc aeg 

Pro Leu Thr 
53 0 

gae tac ggg 

Asp Tyr Gly 
545 

ggg aeg gae 

Gly Thr Asp 



aat aae tae 
Asn Asn Tyr 



gag gag get 
Glu Glu Ala 
595 

gae aag tte 
Asp Lys Phe 
610 

ate aat ege 
lie Asn Arg 
625 

ett ate gtc 
Leu lie Val 



tgg age ega 
Trp Ser Arg 



gtg get gtt 
Val Ala Val 
675 

tae etg ggt 
Tyr Leu Gly 
690 

ctg gtg etg 
Leu Val Leu 
705 

tae eag agg 

Tyr Gin Arg 



etc ctg ctg 
Leu Leu Leu 
500 

gtg gae tgg 
Val Asp Trp 



tac aaa gat 

Tyr Lys Asp 



gca cct gtc 
Ala Pro Val 
550 

tac teg gag 
Tyr Ser Glu 
565 

cec get gat 
Pro Ala Asp 
580 

ttc ttg aag 
Phe Leu Lys 



cag att gcg 
Gin lie Ala 



act ace ate 
Thr Thr lie 
630 

ttc ctg tac 
Phe Leu Tyr 
645 

gtt gcg gtg 
Val Ala Val 
660 

gtg ctg gga 
Val Leu Gly 



gtc cec tee 
Val Pro Ser 



get gtg gga 
Ala Val Gly 
710 

ctg cct agg 
Leu Pro Arg 
725 



etc aca gee 
Leu Thr Ala 
505 

aag gae cat 
Lys Asp His 
520 

ggc aca gee 
Gly Thr Ala 
535 

ttc cec ttc 
Phe Pro Phe 



gca gaa gee 
Ala Glu Ala 



gat cec ege 
Asp Pro Arg 
585 

gaa atg caa 
Glu Met Gin 
600 

ttc tea get 
Phe Ser Ala 
615 

eag gae etg 
Gin Asp Leu 



ate tec ctg 
lie Ser Leu 



gat tec aag 
Asp Ser Lys 
665 

gca gtc gtc 
Ala Val Val 
680 

tet ctg gtc 
Ser Leu Val 
695 

get gae aac 
Ala Asp Asn 



atg cec ggg 
Met Pro Gly 



aat cag act 
Asn Gin Thr 



ttc etc tac 
Phe Leu Tyr 



ctg gee ctg 
Leu Ala Leu 
540 

ett get gtt 
Leu Ala Val 
555 

etg ate ata 
Leu lie lie 
570 

atg gee cac 
Met Ala His 



tec ttc cag 
Ser Phe Gin 



gag cgt tet 
Glu Arg Ser 
620 

cct gtc ttt 
Pro Val Phe 
635 

gee ctg ggc 
Ala Leu Gly 
650 

get act ctg 
Ala Thr Leu 



get gee atg 
Ala Ala Met 



ate att caa 
lie He Gin 
700 

ate tte ate 
He Phe He 
715 

gag cag cga 
Glu Gin Arg 
730 



ctg aat ggc 
Leu Asn Gly 
510 

tgt gee aat 
Cys Ala Asn 
525 

age tgc ata 
Ser Cys He 



ggg ggc tac 

Gly Gly Tyr 



ace tte tet 
Thr Phe Ser 
575 

gee aag etc 
Ala Lys Leu 
590 

aga age aca 
Arg Ser Thr 
605 

ctg gag gae 
Leu Glu Asp 



gee ate age 
Ala He Ser 



age tae tec 
Ser Tyr Ser 
655 

ggc eta ggt 
Gly Leu Gly 

670 

ggc ttc tac 
Gly Phe Tyr 
685 

gtg gta cct 
Val Val Pro 



ttt gtt ett 
Phe Val Leu 



gag get cac 
Glu Ala His 

735 



cag 1536 
Gin 



gee 1584 

Ala 



get 1632 
Ala 



caa 1680 

Gin 

560 

ate 1728 
He 



tgg 1776 
Trp 



get 1824 
Ala 



gag 1872 
Glu 



tac 1920 

Tyr 

640 

aga 1968 
Arg 



ggg 2016 
Gly 



tec 2064 
Ser 



tte 2112 
Phe 



gag 2160 

Glu 

720 

att 2208 
He 
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ggc cgc acc ctg ggt agt gtg gcc ccc age atg ctg ctg tgc age etc 2256 
Gly Arg Thr Leu Gly Ser Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

740 745 750 

tct gag gcc ate tgc ttc ttt eta ggg gee ctg ace tec atg cea get 2304 
Ser Glu Ala lie Cys Phe Phe Leu Gly Ala Leu Thr Ser Met Pro Ala 
755 760 765 

gtg agg acc ttt gee ttg acc tct ggc tta gea ate ate ttt gae ttc 2352 
Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala He He Phe Asp Phe 
770 775 780 

ctg etc eag atg aca gcc ttt gtg gcc ctg etc tec ctg gat age aag 2400 
Leu Leu Gin Met Thr Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 

agg cag gag gcc tct cgc ccc gac gtc gtg tgc tgc ttt tea age ega 2448 
Arg Gin Glu Ala Ser Arg Pro Asp Val Val Cys Cys Phe Ser Ser Arg 

805 810 815 

aat ctg ccc cea ccg aaa caa aaa gaa ggc etc tta ett tgc ttc ttc 2496 
Asn Leu Pro Pro Pro Lys Gin Lys Glu Gly Leu Leu Leu Cys Phe Phe 

820 825 830 

cgc aag ata tac act ccc ttc ctg ctg cac aga ttc ate cgc cet gtt 2544 
Arg Lys He Tyr Thr Pro Phe Leu Leu His Arg Phe He Arg Pro Val 
835 840 845 

gtg ctg ctg etc ttt ctg gtc ctg ttt gga gea aac etc tac tta atg 2592 
Val Leu Leu Leu Phe Leu Val Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 

tgc aac ate age gtg ggg ctg gac cag gat ctg get etg eec aag gat 2 640 

Cys Asn He Ser Val Gly Leu Asp Gin Asp Leu Ala Leu Pro Lys Asp 
865 870 875 880 

tec tac ctg ata gac tac ttc etc ttt etg aac egg tac ttg gaa gtg 2688 
Ser Tyr Leu He Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 

885 890 895 

ggg cet cea gtg tac ttt gac ace ace tea ggc tac aac ttt tec acc 2736 
Gly Pro Pro Val Tyr Phe Asp Thr Thr Ser Gly Tyr Asn Phe Ser Thr 

900 905 910 

gag gea ggc atg aac gcc att tgc tct agt gea ggc tgt gag age ttc 2 784 

Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Glu Ser Phe 
915 920 925 

tec eta acc cag aaa ate cag tat gcc agt gaa ttc cet aat cag tct 2832 
Ser Leu Thr Gin Lys He Gin Tyr Ala Ser Glu Phe Pro Asn Gin Ser 
930 935 940 

tat gtg get att get gea tec tee tgg gta gat gac ttc ate gae tgg 2880 
Tyr Val Ala He Ala Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 

etg acc cea tec tec tec tgc tgc cgc att tat ace cgt ggc ccc cat 2928 
Leu Thr Pro Ser Ser Ser Cys Cys Arg He Tyr Thr Arg Gly Pro His 

965 970 975 



aaa gat gag ttc tgt ccc tea aeg gat act tec ttc aac tgt etc aaa 



2976 
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Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 

980 985 990 



aac tgc atg aac cgc act ctg ggt ccc gtg aga ccc aca aca gaa cag 
Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Thr Glu Gin 
995 1000 1005 



3024 



ttt cat aag tac ctg ccc tgg ttc ctg aat gat acg ccc aac ate 
Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Thr Pro Asn lie 
1010 1015 1020 



3069 



aga tgt cct aaa ggg ggc eta gca gcg tat aga acc tot gtg aat 
Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 

ttg ago tea gat gge eag att ata gee tee eag ttc atg gee tae 
Leu Ser Ser Asp Gly Gin lie lie Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 



3114 



3159 



cac aag ccc tta egg aac tea cag gac ttt aca gaa get etc egg 
His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 



3204 



gca tec egg ttg eta gca gee aac ate aca get gaa eta egg aag 
Ala Ser Arg Leu Leu Ala Ala Asn lie Thr Ala Glu Leu Arg Lys 
1070 1075 1080 



3249 



gtg cet ggg aca gat ecc aae ttt gag gte tte eet tae aeg ate 
Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr lie 
1085 1090 1095 



3294 



tee aat ' gtg tte tae cag caa tae ctg aeg gtt etc eet gag gga 
Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 



3339 



ate ttc act ett get etc tgc ttc gtg ccc acc ttt gtg gte tgc 
lie Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 1125 



3384 



tac etc eta ctg ggc ctg gac ata cgc tea ggc ate etc aac ctg 
Tyr Leu Leu Leu Gly Leu Asp lie Arg Ser Gly lie Leu Asn Leu 
1130 1135 1140 



3429 



etc tee ate att atg ate etc gtg gac acc ate ggc etc atg get 
Leu Ser lie lie Met He Leu Val Asp Thr He Gly Leu Met Ala 
1145 1150 1155 



3474 



gtg tgg ggt ate age tae aat get gtg tec etc ate aac ett gte 
Val Trp Gly He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val 
1160 1165 1170 



3519 



acg gca gtg ggc atg tct gtg gag ttc gtg tec cac att acc egg 
Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg 
1175 1180 1185 



3564 



tec ttt get gta age acc aag cct acc egg ctg gag aga gee aaa 
Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 1200 



3609 



gat get act ate tte atg gge agt gcg gtg ttt get gga gtg gee 
Asp Ala Thr He Phe Met Gly Ser Ala Val Phe Ala Gly Val Ala 



3654 
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1205 1210 1215 

atg acc aac ttc ccg ggc ate etc ate etg gge ttt get eag gee 3699 
Met Thr Asn Phe Pro Gly lie Leu lie Leu Gly Phe Ala Gin Ala 
1220 1225 1230 

eag ett ate eag att tte tte tte ege ete aae ete etg ate ace 3744 
Gin Leu lie Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr 
1235 1240 1245 

ttg etg ggt etg eta eac ggc etg gtc ttc etg ccc gtt gtc etc 3789 
Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 

age tat etg ggg cca gat gtt aac caa get etg gta etg gag gag 3834 
Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Leu Glu Glu 
1265 1270 1275 

aaa eta gee act gag gea gee atg gtc tea gag cet tet tgc cca 3879 
Lys Leu Ala Thr Glu Ala Ala Met Val Ser Glu Pro Ser Cys Pro 
1280 1285 1290 

cag tac ccc ttc ccg get gat gca aac acc agt gac tat gtt aac 3924 
Gin Tyr Pro Phe Pro Ala Asp Ala Asn Thr Ser Asp Tyr Val Asn 
1295 1300 1305 

tac ggc ttt aat cca gaa ttt ate cct gaa att aat get get age 3969 
Tyr Gly Phe Asn Pro Glu Phe He Pro Glu He Asn Ala Ala Ser 
1310 1315 1320 

age tct etg ccc aaa agt gac caa aag ttc taa 4002 
Ser Ser Leu Pro Lys Ser Asp Gin Lys Phe 
1325 1330 



<210> 12 

<211> 1333 

<212> PRT 

<213> Mus sp 



<400> 12 

Met Ala Ala Ala Trp Gin Gly Trp Leu Leu Trp Ala Leu Leu Leu Asn 
15 10 15 



Ser Ala Gin Gly Glu Leu Tyr Thr Pro Thr His Lys Ala Gly Phe Cys 

20 25 30 



Thr Phe Tyr Glu Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Gly Leu 
35 40 45 



Thr Ser Leu Ser Asn He Ser Cys Leu Ser Asn Thr Pro Ala Arg His 
50 55 60 



wo 2004/009772 



40/73 



PCT/US2003/022467 



Val Thr Gly Asp His Leu Ala Leu Leu Gin Arg Val Cys Pro Arg Leu 
65 70 75 80 



Tyr Asn Gly Pro Asn Asp Thr Tyr Ala Cys Cys Ser Thr Lys Gin Leu 

85 90 95 



Val Ser Leu Asp Ser Ser Leu Ser He Thr Lys Ala Leu Leu Thr Arg 

100 105 110 



Cys Pro Ala Cys Ser Glu Asn Phe Val Ser He His Cys His Asn Thr 
115 120 125 



Cys Ser Pro Asp Gin Ser Leu Phe He Asn Val Thr Arg Val Val Gin 
130 135 140 



Arg Asp Pro Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 



Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys Ser Arg Val Arg 

165 170 175 



He Pro Ala Ala Ala Ser Leu Ala Val Gly Ser Met Cys Gly Val Tyr 

180 185 190 



Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 



Thr Gly Asn Gly Leu Ala Pro Leu Asp He Thr Phe His Leu Leu Glu 
210 215 220 



Pro Gly Gin Ala Leu Ala Asp Gly Met Lys Pro Leu Asp Gly Lys He 
225 230 235 240 



Thr Pro Cys Asn Glu Ser Gin Gly Glu Asp Ser Ala Ala Cys Ser Cys 

245 250 255 



Gin Asp Cys Ala Ala Ser Cys Pro Val He Pro Pro Pro Pro Ala Leu 

260 265 270 



Arg Pro Ser Phe Tyr Met Gly Arg Met Pro Gly Trp Leu Ala Leu He 
275 280 285 



He He Phe Thr Ala Val Phe Val Leu Leu Ser Val Val Leu Val Tyr 
290 295 300 
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Leu Arg Val Ala Ser Asn Arg Asn Lys Asn Lys Thr Ala Gly Ser Gin 
305 310 315 320 



Glu Ala Pro Asn Leu Pro Arg Lys Arg Arg Phe Ser Pro His Thr Val 

325 330 335 



Leu Gly Arg Phe Phe Glu Ser Trp Gly Thr Arg Val Ala Ser Trp Pro 

340 345 350 



Leu Thr Val Leu Ala Leu Ser Phe lie Val Val lie Ala Leu Ser Val 
355 360 365 



Gly Leu Thr Phe lie Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser 
370 375 380 



Ala Pro Lys Ser Gin Ala Arg Lys Glu Lys Ala Phe His Asp Glu His 

385 390 395 400 



Phe Gly Pro Phe Phe Arg Thr Asn Gin lie Phe Val Thr Ala Lys Asn 

405 410 415 



Arg Ser Ser Tyr Lys Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe 

420 425 430 



Ser Gly lie Leu Ser Leu Asp Leu Leu Gin Glu Leu Leu Glu Leu Gin 
435 440 445 



Glu Arg Leu Arg His Leu Gin Val Trp Ser His Glu Ala Gin Arg Asn 
450 455 460 



lie Ser Leu Gin Asp lie Cys Tyr Ala Pro Leu Asn Pro His Asn Thr 
465 470 475 480 



Ser Leu Thr Asp Cys Cys Val Asn Ser Leu Leu Gin Tyr Phe Gin Asn 

485 490 495 



Asn His Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Asn Gly Gin 

500 505 510 



Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys Ala Asn Ala 
515 520 525 



Pro Leu Thr Ty^r Lys Asp Gly Thr Ala Leu Ala Leu Ser Cys lie Ala 
530 535 540 
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Asp Tyr Gly Ala 
545 



Gly Thr Asp Tyr 



Asn Asn Tyr Pro 

580 



Glu Glu Ala Phe 
595 



Asp Lys Phe Gin 
610 



lie Asn Arg Thr 
625 



Leu He Val Phe 



Trp Ser Arg Val 

660 



Val Ala Val Val 
675 



Tyr Leu Gly Val 
690 



Leu Val Leu Ala 
705 



Tyr Gin Arg Leu 



Gly Arg Thr Leu 

740 



Ser Glu Ala He 
755 



Val Arg Thr Phe 
770 



Pro Val £>he Pro 
550 



Ser Glu Ala Glu 

565 



Ala Asp Asp Pro 



Leu Lys Glu Met 

600 



He Ala Phe Ser 
615 



Thr He Gin Asp 
630 



Leu Tyr He Ser 
645 



Ala Val Asp Ser 



Leu Gly Ala Val 

680 



Pro Ser Ser Leu 

695 



Val Gly Ala Asp 
710 



Pro Arg Met Pro 
725 



Gly Ser Val Ala 



Cys Phe Phe Leu 

760 



Ala Leu Thr Ser 
775 



42/73 

Phe Leu Ala Val 
555 



Ala Leu He He 
570 



Arg Met Ala His 
585 



Gin Ser Phe Gin 



Ala Glu Arg Ser 

620 



Leu Pro Val Phe 
635 



Leu Ala Leu Gly 
650 



Lys Ala Thr Leu 
665 



Val Ala Ala Met 



Val He He Gin 

700 



Asn He Phe He 
715 



Gly Glu Gin Arg 
730 



Pro Ser Met Leu 
745 



Gly Ala Leu Thr 



Gly Leu Ala He 

780 



Gly Gly Tyr Gin 

560 



Thr Phe Ser He 
575 



Ala Lys Leu Trp 
590 



Arg Ser Thr Ala 
605 



Leu Glu Asp Glu 



Ala He Ser Tyr 

640 



Ser Tyr Ser Arg 
655 



Gly Leu Gly Gly 
670 



Gly Phe Tyr Ser 
685 



Val Val Pro Phe 



Phe Val Leu Glu 

720 



Glu Ala His He 
735 



Leu Cys Ser Leu 
750 



Ser Met Pro Ala 
765 



He Phe Asp Phe 



Leu Leu Gin Met Thr Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
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785 790 795 800 

Arg Gin Glu Ala Ser Arg Pro Asp Val Val Cys Cys Phe Ser Ser Arg 

805 810 815 

Asn Leu Pro Pro Pro Lys Gin Lys Glu Gly Leu Leu Leu Cys Phe Phe 

820 825 830 

Arg Lys He Tyr Thr Pro Phe Leu Leu His Arg Phe He Arg Pro Val 
835 840 845 

Val Leu Leu Leu Phe Leu Val Leu Phe Gly Ala Asn Leu Tyr Leu Met 
850 855 860 

Cys Asn He Ser Val Gly Leu Asp Gin Asp Leu Ala Leu Pro Lys Asp 
865 870 875 880 

Ser Tyr Leu He Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Leu Glu Val 

885 890 895 

Gly Pro Pro Val Tyr Phe Asp Thr Thr Ser Gly Tyr Asn Phe Ser Thr 

900 905 910 

Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Glu Ser Phe 
915 920 925 

Ser Leu Thr Gin Lys He Gin Tyr Ala Ser Glu Phe Pro Asn Gin Ser 
930 935 940 

Tyr Val Ala He Ala Ala Ser Ser Txp Val Asp Asp Phe He Asp Trp 
945 950 955 960 

Leu Thr Pro Ser Ser Ser Cys Cys Arg He Tyr Thr Arg Gly Pro His 

965 970 975 

Lys Asp Glu Phe Cys Pro Ser Thr Asp Thr Ser Phe Asn Cys Leu Lys 

980 985 990 

Asn Cys Met Asn Arg Thr Leu Gly Pro Val Arg Pro Thr Thr Glu Gin 
995 1000 1005 

Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Thr Pro Asn He 
1010 1015 1020 



Arg Cys Pro Lys Gly Gly Leu Ala Ala Tyr Arg Thr Ser Val Asn 
1025 1030 1035 
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Leu Ser Ser Asp Gly Gin He He Ala Ser Gin Phe Met Ala Tyr 
1040 1045 1050 



His Lys Pro Leu Arg Asn Ser Gin Asp Phe Thr Glu Ala Leu Arg 
1055 1060 1065 



Ala Ser Arg Leu Leu Ala Ala Asn lie Thr Ala Glu Leu Arg Lys 
1070 1075 1080 



Val Pro Gly Thr Asp Pro Asn Phe Glu Val Phe Pro Tyr Thr He 
1085 1090 1095 



Ser Asn Val Phe Tyr Gin Gin Tyr Leu Thr Val Leu Pro Glu Gly 
1100 1105 1110 



He Phe Thr Leu Ala Leu Cys Phe Val Pro Thr Phe Val Val Cys 
1115 1120 1125 



Tyr Leu Leu Leu Gly Leu Asp He Arg Ser Gly He Leu Asn Leu 
1130 1135 1140 



Leu Ser He He Met He Leu Val Asp Thr He Gly Leu Met Ala 
1145 1150 1155 



Val Trp Gly He Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val 
1160 1165 1170 



Thr Ala Val Gly Met Ser Val Glu Phe Val Ser His He Thr Arg 
1175 1180 1185 



Ser Phe Ala Val Ser Thr Lys Pro Thr Arg Leu Glu Arg Ala Lys 
1190 1195 1200 



Asp Ala Thr He Phe Met Gly Ser Ala Val Phe Ala Gly Val Ala 
1205 1210 1215 



Met Thr Asn Phe Pro Gly He Leu He Leu Gly Phe Ala Gin Ala 
1220 1225 1230 



Gin Leu He Gin He Phe Phe Phe Arg Leu Asn Leu Leu He Thr 
1235 1240 1245 



Leu Leu Gly Leu Leu His Gly Leu Val Phe Leu Pro Val Val Leu 
1250 1255 1260 
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Ser Tyr Leu Gly Pro Asp Val Asn Gin Ala Leu Val Leu Glu Glu 

1265 1270 1275 

Lys Leu Ala Thr Glu Ala Ala Met Val Ser Glu Pro Ser Cys Pro 

1280 1285 1290 

Gin Tyr Pro Phe Pro Ala Asp Ala Asn Thr Ser Asp Tyr Val Asn 

1295 1300 1305 

Tyr Gly Phe Asn Pro Glu Phe lie Pro Glu lie Asn Ala Ala Ser 

1310 1315 1320 

Ser Ser Leu Pro Lys Ser Asp Gin Lys Phe 

1325 1330 

<210> 13 
<211> 3999 
<212> DNA 
<213> Mus sp. 



<400> 13 

atggcngcng cntggcargg ntggytnytn tgggcnytny tnytnaayws ngcncarggn 60 

garytntaya cnccnacnca yaargcnggn ttytgyacnt tytaygarga rtgyggnaar 120 

aayccngary tnwsnggngg nytnacnwsn ytnwsnaaya thwsntgyyt nwsnaayacn 180 

ccngcnmgnc aygtnacngg ngaycayytn gcnytnytnc armgngtntg yccnmgnytn 240 

tayaayggnc cnaaygayac ntaygcntgy tgywsnacna arcarytngt nwsnytngay 3 00 

wsnwsnytnw snathacnaa rgcnytnytn acnmgntgyc cngcntgyws ngaraaytty 3 60 

gtnwsnathc aytgycayaa yacntgywsn ccngaycarw snytnttyat haaygtnacn 420 

mgngtngtnc armgngaycc nggncarytn ccngcngtng tngcntayga rgcnttytay 480 

carmgnwsnt tygcngaraa rgcntaygar wsntgywsnm gngtnmgnat hccngcngcn 540 

gcnwsnytng cngtnggnws natgtgyggn gtntayggnw sngcnytntg yaaygcncar 600 

mgntggytna ayttycargg ngayacnggn aayggnytng cnccnytnga yathacntty 660 

cayytnytng arccnggnca rgcnytngcn gayggnatga arccnytnga yggnaarath 720 

acnccntgya aygarwsnca rggngargay wsngcngcnt gywsntgyca rgaytgygcn 780 

gcnwsntgyc cngtnathcc nccnccnccn gcnytnmgnc cnwsnttyta yatgggnmgn 840 
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ccnaarcara 


argarggnyt 


nytnytntgy 


ttyttymgna 


ytncaymgnt 


tyathmgncc 


ngtngtnytn 


ytnytnttyy 


ytntayytna 


tgtgyaayat 


hwsngtnggn 


ytngaycarg 
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tnttygtnyt 


nytnwsngtn 


900 


ayaaracngc 


nggnwsncar 


960 


ayacng tny t 


nggnmgntty 


1020 


cngtnytngc 


nytnwsntty 


1080 


arytnacnac 


ngayeengtn 


1140 


argent tyca 


ygaygarcay 


1200 


cnaaraavmci 


nwsnwsntay 


1260 


cmatlivtnws 


nytngayytn 


1320 


tncarcftntcT 


Qwsnc ay aar 


1380 


env t naave c 


nc ay aay a cn 


1440 


t ve ar aavaa 


y c ay ac ny t n 


1500 


snytngtnga 


ytggaargay 


1560 


ayggnaenge 

,^3 ^3 ^ ,^3 — 


nytngcnytn 


1620 


tnacngtncig 

^^•"53 ^^ 53 ^^ 53 53 


nggn t ay c ar 


1680 


tvwsna t h.aa 


yaay t ay c cn 


1740 


araen t tyy t 


naargaratg 


1800 


en 1 1 vwsncr e 


ngarmgnwsn 


1860 


cnQtnt tvcfc 


natliwsntav 


1920 


avws nmcm t cr 


awsnmcrnq'tn 


1980 


cna tncT tnvt 


ncTcrncrcncftn 


2040 


snwsnvtncft 


natliatlicar 


2100 


thttyathtt 


ygtnytngar 


2160 


encayathgg 


nmgn ac ny tn 


2220 


argcnathtg 


yttyttyytn 


2280 


tnaenwsngg 


nytngcnath 


2340 


tnytnwsnyt 


ngaywsnaar 


2400 


snwsnmgnaa 


yytnccnccn 


2460 


arathtayae 


nccnttyytn 


2520 


tngtnytntt 


yggngcnaay 


2580 


ayytngcnyt 


nccnaargay 


2640 
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wsntayyfcna 


thgaytaytt 


yytnttyytn 


aaymgntayy 


tngargtngg 


nccnccngtn 


2700 


tayttygaya 


cnacnwsngg 


ntayaaytty 


wsnacngarg 


cnggnatgaa 


ygcnathtgy 


2760 


wsnwsngcng 


gntgygarws 


nttywsnytn 


acncaraara 


thcartaygc 


nwsngartty 


2820 


ccnaaycarw 


sntaygtngc 


nathgcngcn 


wsnwsntggg 


tngaygaytt 


yathgaytgg 


2880 


ytnacnccnw 


snwsnwsntg ytgymgnath. 


tayacnmgng 


gnccncayaa 


rgaygartty 


2940 


tgyccnwsna 


cngayacnws 


nttyaaytgy 


ytnaaraayt 


gyatgaaymg 


na cny t nggn 


3000 


ccngtnmgnc 


cnacnacnga 


rcarttycay 


aartayytnc 


cntggttyyt 


naaygayacn 


3060 


ccnaayathm gntgyccnaa 


rggnggnytn 


gcngcntaym gnacnwsngt 


r 

na ayy tnwsn 


3120 


wsngayggnc 


arathathgc 


nwsncartty 


atggcntayc 


ayaarccnyt 


nmgnaaywsn 


3180 


cargayttya 


cngargcnyt 


nmgngcnwsn 


mgnytnytng 


cngcnaayat 


hacngcngar 


3240 


ytnmgnaarg 


tnccnggnac 


ngayccnaay 


ttygargtnt 


tyccntayac 


nathwsnaay 


3300 


crtnttvtavc 


aire air t aw t 


nacngtnytn 


c cngarggna 


thfcfcyacnyt 


ngcnytntgy 


3360 


ttygtnccna 


cnttygtngt 


ntgytayytn 


ytnytnggny 


tngayathmg 


nwsnggnath 


3420 


ytnaayytny 


tnwsnatliat 


hatgathytn 


gtngayacna 


thggnytnat 


ggcngtntgg 


3480 


ggnathwsnt 


ayaaygcngt 


nwsnytnath 


aayytngtna 


cngcngtngg 


natgwsngtn 


3540 


garttygtnw 


sncayathac 


mngnwsntty 


gcngtnwsna 


cnaarccnac 


nmgnytngar 


3600 


mgngcnaarg 


aygcnacnat 


httyatgggn 


wsngcngtnt 


tygcnggngt 


ngcnatgacn 


3660 


aayttyccng 


gnathytnat 


hytnggntty 


gcncargcnc 


arytnathca 


rathttytty 


3720 


ttymgnytna 


ayytnytnat 


hacny t ny tn 


ggnytnytnc 


ayggnytngt 


nttyytnccn 


3780 


gtngtnytnw 


sntayytngg 


nccngaygtn 


aaycargcny 


tngtnytnga 


rgaraarytn 


*3 o yi rt 


gcnacngarg 


cngcnatggt 


nwsngarccn 


wsntgyccnc 


artayccntt 


yccngcngay 


3900 


gcnaayacnw 


sngaytaygt 


naaytayggn 


ttyaayccng 


arttyathcc 


ngarathaay 


3960 


gcngcnwsnw 


snwsnytncc 


naarwsngay 


caraartty 






3999 



<210> 14 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer 
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<400> 14 

tcttcaccct tgctctttgc 



<210> 15 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



20 



<220> 

<223> primer 

<400> 15 

aatgatggag agtaggttga ggat 24 

<210> 16 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 16 

tgcccacctt tgttgtctgc taccta 26 

<210> 17 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 17 

atcgctgaca ggatgcagaa g 21 



<210> 18 
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<211> 22 
<212> DNA 

<213> Artificial Secjuence 



<220> 

<223> primer 

<400> 18 

tcaggaggag caatgatctt ga 

<210> 19 

<211> 30 

<212> DNA 

<213> Artificial Secjuence 



<220> 

<223> primer 

<400> 19 

agattactgc cctggctcct agcaccatta 

<210> 20 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 20 

atcctcatcc tgggctttgc 

<210> 21 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer 

<400> 21 

gcaaggtgat caggaggttg a 

<210> 22 

<211> 29 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 22 

cccagcttat ccagattttc ttcttccgc 29 

<210> 23 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 23 

tcttcaccct tgctctttgc 

<210> 24 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> primer 
<400> 24 
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aatgatggag agtaggttga ggat 24 

<210> 25 

<211> 24 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 25 

tgcccacctt tgttgtctgc tacc 24 

<210> 26 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 26 

agcacctgtc cactgaagat ttc 23 

<210> 27 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 27 

tggacgctga gcttcagttc t 21 



<210> 28 
<211> 24 



wo 2004/009772 PCT/US2003/022467 

52/73 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 28 

cttctctgcg ctgcctcgat ggaa 24 

<210> 29 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 29 

agtaaaaagg gctcgcagga t 21 

<210> 30 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 30 

ggcagctggt gacatcagag a 21 

<210> 31 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer 

<400> 31 

aggaggccat gcaggcctac tctga 25 

<210> 32 

<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> primer 

<400> 32 

gagtccacgg tcagtccatg t 21 

<210> 33 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 

<400> 33 

ttatgaacaa caatgccaag caa • 23 

<210> 34 

<211> 34 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> primer 
<400> 34 

agtccttagg tagtggctta gtccctggaa gctc 34 



wo 2004/009772 



54/73 



PCT/US2003/022467 



<210> 35 

<211> 52 

<212> DNA 

<213> Artificial Secxuence 



<220> 

<223> probe 

<400> 35 

gtaatacgac tcactatagg gccctgacgg tccttcctga gggaatcttc ac 52 

<210> 36 

<211> 50 

<212> DNA 

<213> Artificial Secjuence 



<220> 

<223> probe 

<400> 36 

gtaatacgac tcactatagg gcctgggaag ttggtcatgg ccactccagc 50 

<210> 37 

<211> 8 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> FLAG tag 

<400> 37 

Asp Tyr Lys Asp 
1 



Asp Asp Asp Lys 
5 



<210> 38 
<211> 4 
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<212> PRT 

<213> Artificial Sequence 



<220> 

<223> motif 

<400> 38 

Tyr Gin Arg Leu 
1 

<210> 39 

<211> 19 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> antigen 
<400> 39 

Glu Gin Phe His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Pro Pro Asn 
15 10 15 

lie Arg Cys 



<210> 40 

<211> 16 

<212> PRT 

<213> Artificial Sequence 



<220> 

<2 2 3> antigen 
<400> 40 

Glu Ala Phe Tyr Gin Arg Ser Phe Ala Glu Lys Ala Tyr Glu Ser Cys 
15 10 15 
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<210> 41 
<211> 15 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> antigen 
<400> 41 

Gly Gin Thr Ser Leu Val Asp Trp Lys Asp His Phe Leu Tyr Cys 
15 10 15 

<210> 42 

<211> 17 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> antigen 
<400> 42 

Cys Ala Asn Ala Pro Leu Thr Phe Lys Asp Gly Thr Ala Leu Ala Leu 
15 10 15 

Ser 

<210> 43 

<211> 5092 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (57) . . (4136) 

<223> 
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<400> 43 

cttggctgtt cctgaggcct ggcctggctc cccgctgacc ccttcccaga cctggg atg 59 

Met 
1 



gcg gag gcc ggc ctg agg ggc tgg ctg ctg tgg gcc ctg etc ctg cgc 
Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu Arg 

5 10 15 



gcc ttc tat gac gaa tgt ggg. aag aac oca gag ctg tct gga age ete 
Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser Leu 
35 40 45 



ate aea ggt gat eae etg ate eta tta eag aag ate tge ecc cgc etc 
lie Thr Gly Asp His Leu lie Leu Leu Gin Lys lie Cys Pro Arg Leu 

70 75 80 



eea gee tge tet gae aat ttt gtg aae etg eae tge eae aae aeg tge 
Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr Cys 
115 120 125 

age eee aat eag age etc ttc ate aat gtg acc cgc gtg gcc cag eta 
Ser Pro Asn Gin Ser Leu Phe He Asn Val Thr Arg Val Ala Gin Leu 
130 135 140 145 



eat age ttt gcc gag cag age tat gae tee tge age egt gtg ege gte 
His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg Val 

165 170 175 



tet gee ett tge aat gee eag ege tgg ete aae ttc cag gga gac aca 
Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp Thr 
195 200 205 



107 



ttg gcc cag agt gag cct tac aca acc ate cac cag cct ggc tac tge 155 
Leu Ala Gin Ser Glu Pro Tyr Thr Thr He His Gin Pro Gly Tyr Cys 
20 25 30 



203 



atg aea ete tee aae gtg tee tge etg tee aae aeg eeg gee ege aag 251 
Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg Lys 
50 55 60 65 



299 



tac acc ggc ccc aac acc caa gcc tge tge tec gcc aag cag ctg gta 347 
Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu Val 

85 90 95 

tea ctg gaa gcg agt ctg teg ate aee aag gee ete ete aee ege tge 395 
Ser Leu Glu Ala Ser Leu Ser He Thr Lys Ala Leu Leu Thr Arg Cys 
100 105 110 



443 



491 



ggg get gga caa etc cca get gtg gtg gcc tat gag gcc ttc tac cag 539 
Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr Gin 

150 155 160 



587 



eet gea get gee aeg etg get gtg gge aee atg tgt gge gtg tat gge 635 
Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr Gly 
180 185 190 



683 



ggc aat ggt ctg gcc cca ctg gac ate acc ttc cac etc ttg gag cct 
Gly Asn Gly Leu Ala Pro Leu Asp He Thr Phe His Leu Leu Glu Pro 



731 
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210 215 220 225 

ggc cag gcc gtg ggg agt ggg att cag cct ctg aat gag ggg gtt gca 77 9 

Gly Gin Ala Val Gly Ser Gly He Gin Pro Leu Asn Glu Gly Val Ala 

o/in 



230 235 240 



cgt tgc aat gag tec caa ggt gac gac gtg gcg acc tgc tec tgc caa 
Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys Gin 

245 250 255 



ggc ccc ttc ttc cga acc aac cag gtg ate ctg aeg get cct aac egg 
Gly Pro Phe Phe Arg Thr Asn Gin Val He Leu Thr Ala Pro Asn Arg 

405 410 415 



827 



gac tgt get gea tee tgt eet gcc ata gcc cgc ccc cag gee etc gac 875 
Asp Cys Ala Ala Ser Cys Pro Ala He Ala Arg Pro Gin Ala Leu Asp 
260 265 270 

tec acc ttc tac ctg ggc cag atg ccg ggc agt ctg gtc etc ate ate 923 
Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu He He 
275 280 285 

ate etc tgc tet gtc ttc get gtg gtc acc ate ctg ett gtg gga ttc 971 
He Leu Cys Ser Val Phe Ala Val Val Thr He Leu Leu Val Gly Phe 
290 295 300 305 

cgt gtg gcc ccc gee agg gac aaa age aag atg, gtg gac ccc aag aag 1019 
Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys Lys 

310 315 320 

ggc ace age etc tet gac aag etc age ttc tec acc cac acc etc ett 1067 
Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu Leu 

325 330 335 

ggc cag ttc ttc cag ggc tgg ggc aeg tgg gtg get teg tgg cct ctg 1115 
Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro Leu 
340 345 350 

acc ate ttg gtg eta tet gtc ate ccg gtg gtg gee ttg gea gcg ggc 1163 
Thr He Leu Val Leu Ser Val He Pro Val Val Ala Leu Ala Ala Gly 
355 360 365 

ctg gtc ttt aca gaa etc act aeg gac ccc gtg gag ctg tgg teg gee 1211 
Leu Val Phe Thr Glu Leu Thr Thr Asp Pro Val Glu Leu Trp Ser Ala 
370 375 380 385 

ccc aae age eaa gee egg agt gag aaa get ttc cat gac cag cat ttc 1259 
Pro Asn Ser Gin Ala Arg Ser Glu Lys Ala Phe His Asp Gin His Phe 

390 395 400 



1307 



tec age tac agg tat gac tet ctg ctg ctg ggg ccc aag aac ttc age 1355 
Ser Ser Tyr Arg Tyr Asp Ser Leu Leu Leu Gly Pro Lys Asn Phe Ser 
420 425 430 

gga ate ctg gac ctg gac ttg ctg ctg gag ctg eta gag ctg cag gag 1403 
Gly He Leu Asp Leu Asp Leu Leu Leu Glu Leu Leu Glu Leu Gin Glu 
435 440 445 

agg ctg egg cac etc cag gta tgg teg ccc gaa gea cag cgc aac ate 1451 
Arg Leu Arg His Leu Gin Val Trp Ser Pro Glu Ala Gin Arg Asn He 
450 455 460 465 



4 
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tec ctg cag gac ate tge tae gee ccc etc aat ccg gae aat ace agt 1499 
Ser Leu Gin Asp He Cys Tyr Ala Pro Leu Asn Pro Asp Asn Thr Ser 

470 475 480 



etc tac gac tge tgc ate aac age ete ctg cag tat ttc cag aac aac 
Leu Tyr Asp Cys Cys He Asn Ser Leu Leu Gin Tyr Phe Gin Asn Asn 

485 490 495 



1547 



cgc acg etc ctg ctg etc aca gee aac cag aca ctg atg ggg cag ace 1595 
Arg Thr Leu Leu Leu Leu Thr Ala Asn Gin Thr Leu Met Gly Gin Thr 
500 505 510 



tec caa gtc gac tgg aag gac eat ttt 
Ser Gin Val Asp Trp Lys Asp His Phe 
515 520 

etc ace ttc aag gat gge aea gee ctg 
Leu Thr Phe Lys Asp Gly Thr Ala Leu 
530 535 

tae ggg gee eet gtc ttc ccc ttc ctt 
Tyr Gly Ala Pro Val Phe Pro Phe Leu 

550 

aag gac tat tct gag gca gag gcc ctg 
Lys Asp Tyr Ser Glu Ala Glu Ala Leu 

565 570 



ctg tac tgt gee aat gee eeg 1643 
Leu Tyr Cys Ala Asn Ala Pro 
525 

gee ctg age tge atg get gae 1691 
Ala Leu Ser Cys Met Ala Asp 
540 545 

gcc att ggg ggg tac aaa gga 1739 

Ala He Gly Gly Tyr Lys Gly 
555 560 

ate atg acg ttc tec etc aac 1787 
He Met Thr Phe Ser Leu Asn 

575 



aat tac cct gee ggg gae eee egt ctg gee cag gee aag ctg tgg gag 1835 
Asn Tyr Pro Ala Gly Asp Pro Arg Leu Ala Gin Ala Lys Leu Trp Glu 
580 585 590 



gag gee tte tta gag gaa atg ega gee 
Glu Ala Phe Leu Glu Glu Met Arg Ala 
595 600 

atg tte cag gtc acg ttc atg get gag 
Met Phe Gin Val Thr Phe Met Ala Glu 
610 615 

aat cgc acc aca get gaa gac ctg ccc 
Asn Arg Thr Thr Ala Glu Asp Leu Pro 

630 



tte eag egt egg atg get gge 1883 
Phe Gin Arg Arg Met Ala Gly 
605 

egc tct ctg gaa gac gag ate 1931 
Arg Ser Leu Glu Asp Glu He 
620 625 

ate ttt gcc acc age tac att 1979 
He Phe Ala Thr Ser Tyr He 
635 640 



gtc ata ttc ctg tae ate tct ctg gee ctg gge age tat tee age tgg 
Val He Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Ser Trp 

645 650 655 



2027 



age ega gtg atg gtg gac tee aag gee acg ctg gge ete gge ggg gtg 2075 
Ser Arg Val Met Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly Val 
660 665 670 



gee gtg gtc ctg gga gca gtc atg get gee atg gge ttc ttc tec tac 2123 
Ala Val Val Leu Gly Ala Val Met Ala Ala Met Gly Phe Phe Ser Tyr 
675 680 685 

ttg ggt ate cgc tec tec ctg gtc ate ctg caa gtg gtt cct ttc ctg 2171 
Leu Gly He Arg Ser Ser Leu Val He Leu Gin Val Val Pro Phe Leu 
690 695 700 705 
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gtg ctg tec gtg ggg get gat aae ate tte ate ttt gtt etc gag tae 2219 
Val Leu Ser Val Gly Ala Asp Asn lie Phe lie Phe Val Leu Glu Tyr 

710 715 720 

eag agg etg eee egg agg eet ggg gag cca cga gag gte eae att ggg 2267 
Gin Arg Leu Pro Arg Arg Pro Gly Glu Pro Arg Glu Val His lie Gly 

725 730 735 

ega gee eta gge agg gtg get eee age atg etg ttg tge age ete tet 2315 
Arg Ala Leu Gly Arg Val Ala Pro Ser Met Leu Leu Cys Ser Leu Ser 
740 745 750 

gag gee ate tgc ttc ttc eta ggg gee etg ace ccc atg cca get gtg 2363 
Glu Ala lie Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala Val 
755 760 765 

egg acc ttt gcc ctg acc tct gge ett gca gtg ate ett gae tte ete 2411 
Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala Val He Leu Asp Phe Leu 
770 775 780 785 

ctg eag atg tea gee ttt gtg gee etg ete tee etg gae age aag agg 2459 
Leu Gin Met Ser Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys Arg 

790 795 800 

eag gag gee tee egg ttg gae gte tge tge tgt gte aag eee eag gag 2507 
Gin Glu Ala Ser Arg Leu Asp Val Cys Cys Cys Val Lys Pro Gin Glu 

805 810 815 

etg eee ceg cet gge cag gga gag ggg etc etg ett gge ttc ttc caa 255 5 

Leu Pro Pro Pro Gly Gin Gly Glu Gly Leu Leu Leu Gly Phe Phe Gin 
820 825 830 

aag get tat gcc eee ttc ctg ctg eae tgg ate act cga ggt gtt gtg 2603 
Lys Ala Tyr Ala Pro Phe Leu Leu His Trp He Thr Arg Gly Val Val 
835 840 845 

etg ctg etg ttt ete gee etg tte gga gtg age ete tae tee atg tge 2651 
Leu Leu Leu Phe Leu Ala Leu Phe Gly Val Ser Leu Tyr Ser Met Cys 
850 855 860 865 

eae ate age gtg gga etg gae eag gag etg gee etg eee aag gae teg 2699 
His He Ser Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp Ser 

870 875 880 

tae etg ett gae tat ttc etc ttt ctg aae cgc tae ttc gag gtg ggg 2747 
Tyr Leu Leu Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Phe Glu Val Gly 

885 890 895 

gcc ccg gtg tac ttt gtt acc acc ttg gge tae aae ttc tec age gag 2795 
Ala Pro Val Tyr Phe Val Thr Thr Leu Gly Tyr Asn Phe Ser Ser Glu 
900 905 910 

get ggg atg aat gee ate tge tee agt gca gge tge aae aae tte tee 2843 
Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Asn Asn Phe Ser 
915 920 925 

tte aee eag aag ate eag tat gee aea gag tte eet gag eag tet tae 2891 
Phe Thr Gin Lys He Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser Tyr 
930 935 940 945 



etg gee ate eet gee tee tee tgg gtg gat gae tte att gae tgg etg 



2939 
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Leu Ala lie Pro Ala Ser Ser Trp Val Asp Asp Phe lie Asp Trp Leu 

950 955 960 



acc ccg tec tec tgc tgc cgc ctt tat ata tot ggc ccc aat aag gac 
Thr Pro Ser Ser Cys Cys Arg Leu Tyr lie Ser Gly Pro Asn Lys Asp 

965 970 975 



2987 



aag ttc tgc ccc teg acc gtc aac tct ctg aac tgc eta aag aac tgc 3 035 

Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn Cys 
980 985 990 

atg age ate aeg atg ggc tct gtg agg ccc teg gtg gag cag ttc cat 30 83 
Met Ser He Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe His 
995 1000 1005 

aag tat ctt ccc tgg ttc ctg aac gac egg ccc aac ate aaa tgt 3128 
Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn He Lys Cys 
1010 1015 1020 



ccc aaa ggc ggc ctg gea gea tac age ace tct gtg aac ttg act 
Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu Thr 
1025 1030 1035 



3173 



tea gat ggc cag gtt tta gac aca gtt gee att ctg tea ccc agg 
Ser Asp Gly Gin Val Leu Asp Thr Val Ala He Leu Ser Pro Arg 
1040 1045 1050 



3218 



ctg 
Leu 
1055 

ctg 
Leu 
1070 



gag tac agt ggc aca ate teg get cac tgc 
Glu Tyr Ser Gly Thr He Ser Ala His Cys 

1060 1065 



gat 
Asp 



tea 

Ser 



gee 
Ala 



tee 

Ser 



agg 
Arg 
1075 



ttc 
Phe 



atg 

Met 



gee 

Ala 



tat 

Tyr 



cac 
His 
1080 



aac etc tac etc 
Asn Leu Tyr Leu 



aag ccc ctg aaa 
Lys Pro Leu Lys 



3263 



3308 



aac tea cag gat tac aca gaa get ctg egg gea get ega gag ctg 
Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala Ala Arg Glu Leu 
1085 1090 1095 



3353 



gea gee aac ate act get gac ctg egg aaa gtg cet gga aca gac 
Ala Ala Asn He Thr Ala Asp Leu Arg Lys Val Pro Gly Thr Asp 
1100 1105 1110 



3398 



ccg get ttt gag gtc ttc ccc tac aeg ate acc aat gtg ttt tat 
Pro Ala Phe Glu Val Phe Pro Tyr Thr He Thr Asn Val Phe Tyr 
1115 1120 1125 



3443 



gag cag tac ctg ace ate etc cet gag ggg etc ttc atg etc age 
Glu Gin Tyr Leu Thr He Leu Pro Glu Gly Leu Phe Met Leu Ser 
1130 1135 1140 



3488 



etc tgc ctt gtg ccc acc ttc get gtc tee tgc etc ctg ctg ggc 
Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys Leu Leu Leu Gly 
1145 1150 1155 



3533 



ctg 
Leu 
1160 



gac 
Asp 



ctg cgc tec 
Leu Arg Ser 



ggc 

Gly 
1165 



etc 
Leu 



etc 
Leu 



aac 
Asn 



ctg 
Leu 



etc 
Leu 
1170 



tec 
Ser 



att 
He 



gtc 
Val 



atg 
Met 



3578 



ate etc gtg gac act gtc ggc ttc atg gee ctg tgg ggc ate agt 
He Leu Val Asp Thr Val Gly Phe Met Ala Leu Trp Gly He Ser 



3623 
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1175 



1180 



1185 



tac aat get gtg tec etc ate aae etg gte teg geg gtg gge atg 
Tyr Asn Ala Val Ser Leu He Asn Leu Val Ser Ala Val Gly Met 
1190 1195 1200 

tet gtg gag ttt gtg tec cac att ace cgc tee ttt gee ate age 
Ser Val Glu Phe Val Ser His He Thr Arg Ser Phe Ala He Ser 
1205 1210 1215 

aee aag eee acc tgg etg gag agg gee aaa gag gcc acc ate tct 
Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu Ala Thr He Ser 
1220 1225 1230 

atg gga agt gcg gtg ttt gca ggt gtg gcc atg acc aae etg cet 
Met Gly Ser Ala Val Phe Ala Gly Val Ala Met Thr Asn Leu Pro 
1235 1240 1245 

ggc ate ett gtc etg gge etc gee aag gee eag etc att cag ate 
Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin Leu He Gin He 
1250 1255 1260 

ttc ttc ttc cgc etc aac etc etg ate act etg etg gge etg etg 
Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu Leu Gly Leu Leu 
1265 1270 1275 

eat gge ttg gtc ttc etg ccc gtc ate etc age tac gtg ggg cct 
His Gly Leu Val Phe Leu Pro Val He Leu Ser Tyr Val Gly Pro 
1280 1285 1290 

gac gtt aac ccg get etg gca etg gag cag aag egg get gag gag 
Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys Arg Ala Glu Glu 
1295 1300 1305 

gcg gtg gca gca gte atg gtg gee tct tge cea aat cac eee tee 
Ala val Ala Ala Val Met Val Ala Ser Cys Pro Asn His Pro Ser 
1310 1315 1320 

ega gtc tee aca get gac aae ate tat gte aae cac age ttt gaa 
Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn His Ser Phe Glu 
1325 1330 1335 

ggt tct ate aaa ggt get ggt gcc ate age aac ttc ttg ccc aac 
Gly Ser He Lys Gly Ala Gly Ala He Ser Asn Phe Leu Pro Asn 
1340 1345 1350 

aat ggg egg cag ttc tga tacagccaga ggccctgtct aggctctatg 
Asn Gly Arg Gin Phe 



3668 



3713 



3758 



3803 



3848 



3893 



3938 



3983 



4028 



4073 



4118 



4166 



1355 














gccctgaacc 


aaagggttat 


ggggatctte 


cttgtgaetg 


ceccttgaca 


caegecctec 


4226 


tcaaatccta 


ggggaggeca 


ttcccatgag 


actgcctgtc 


actggaggat 


ggcctgctct 


4286 


tgaggtatee 


aggcageace 


actgatgget 


cetetgetee 


eatagtgggt 


ceccagttte 


4346 


caagtcacct 


aggccttggg 


cagtgcctcc 


tcctgggeet 


gggtetggaa 


gttggcagga 


4406 


acagacacac 


teeatgtttg 


tcccacactc 


actcactttc 


ctaggagccc 


acttctcatc 


4466 


caacttttec 


cttcteagtt 


cetctctega 


aagtettaat 


tetgtgteag 


taagtettta 


4526 
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acacgtagca 


gtgtccctga 


gaacacagac 


aatgaccact 


accctgggtg 


tgatatcaca 


4586 


ggaggccaga 


gagaggcaaa 


ggctcaggcc 


aagagccaac 


gctgtgggag 


gccggtcggc 


4646 


agccactccc 


tccagggcgc 


acctgcaggt 


ctgccatcca 


cggccttttc 


tggcaagaga 


4706 


agggcccagg 


aaggatgctc 


tcataaggcc 


caggaaggat 


gctctcataa 


gcaccttggt 


4766 


catggattag 


cccctcctgg 


aaaatggtgt 


tgggtttggt 


ctccagctcc 


aatacttatt 


4826 


aaggctgttg 


ctgccagtca 


aggccaccca 


ggagtctgaa 


ggctgggagc 


tcttggggct 


4886 


gggctggtcc 


tcccatcttc 


acctcgggcc 


tggatcccag 


gcctcaaacc 


agcccaaccc 


4946 


gagcttttgg 


acagctctcc 


agaagcatga 


actgcagtgg 


agatgaagat 


cctggctctg 


5006 


tgctgtgcac 


ataggtgttt 


aataaacatt 


tgttggcaga 


aaaaaaaaaa 


aaaaaaaaaa 


5066 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaa 








5092 



<210> 44 

<211> 1359 

<212> PRT 

<213> Homo sapiens 

<400> 44 

Met Ala Glu Ala Gly Leu Arg Gly Trp Leu Leu Trp Ala Leu Leu Leu 
15 10 15 

Arg Leu Ala Gin Ser Glu Pro Tyr Thr Thr lie His Gin Pro Gly Tyr 

20 25 30 

Cys Ala Phe Tyr Asp Glu Cys Gly Lys Asn Pro Glu Leu Ser Gly Ser 
35 40 45 

Leu Met Thr Leu Ser Asn Val Ser Cys Leu Ser Asn Thr Pro Ala Arg 
50 55 60 

Lys lie Thr Gly Asp His Leu He Leu Leu Gin Lys He Cys Pro Arg 
65 70 75 80 

Leu Tyr Thr Gly Pro Asn Thr Gin Ala Cys Cys Ser Ala Lys Gin Leu 

85 90 95 



Val Ser Leu Glu Ala Ser Leu Ser He Thr Lys Ala Leu Leu Thr Arg 

100 105 110 
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Cys Pro Ala Cys Ser Asp Asn Phe Val Asn Leu His Cys His Asn Thr 
115 120 125 



Cys Ser Pro Asn Gin Ser Leu Phe lie Asn Val Thr Arg Val Ala Gin 
130 135 140 



Leu Gly Ala Gly Gin Leu Pro Ala Val Val Ala Tyr Glu Ala Phe Tyr 
145 150 155 160 



Gin His Ser Phe Ala Glu Gin Ser Tyr Asp Ser Cys Ser Arg Val Arg 

165 170 175 



Val Pro Ala Ala Ala Thr Leu Ala Val Gly Thr Met Cys Gly Val Tyr 

180 185 190 



Gly Ser Ala Leu Cys Asn Ala Gin Arg Trp Leu Asn Phe Gin Gly Asp 
195 200 205 



Thr Gly Asn Gly Leu Ala Pro Leu Asp He Thr Phe His Leu Leu Glu 
210 215 220 



Pro Gly Gin Ala Val Gly Ser Gly He Gin Pro Leu Asn Glu Gly Val 
225 230 235 240 



Ala Arg Cys Asn Glu Ser Gin Gly Asp Asp Val Ala Thr Cys Ser Cys 

245 250 255 



Gin Asp Cys Ala Ala Ser Cys Pro Ala He Ala Arg Pro Gin Ala Leu 

260 265 270 



Asp Ser Thr Phe Tyr Leu Gly Gin Met Pro Gly Ser Leu Val Leu He 
275 280 285 



He He Leu Cys Ser Val Phe Ala Val Val Thr He Leu Leu Val Gly 
290 295 300 



Phe Arg Val Ala Pro Ala Arg Asp Lys Ser Lys Met Val Asp Pro Lys 
305 310 315 320 



Lys Gly Thr Ser Leu Ser Asp Lys Leu Ser Phe Ser Thr His Thr Leu 

325 330 335 



Leu Gly Gin Phe Phe Gin Gly Trp Gly Thr Trp Val Ala Ser Trp Pro 

340 345 350 



wo 2004/009772 



PCT/US2003/022467 



Leu Thr lie Leu 

355 



Gly Leu Val Phe 
370 



Ala Pro Asn Ser 
385 



Phe Gly Pro Phe 



Arg Ser Ser Tyr 

420 



Ser Gly lie Leu 
435 



Glu Arg Leu Arg 
450 



lie Ser Leu Gin 
465 



Ser Leu Tyr Asp 



Asn Arg Thr Leu 

500 



Thr Ser Gin Val 
515 



Pro Leu Thr Phe 
530 



Asp Tyr Gly Ala 

545 



Gly Lys Asp Tyr 



Asn Asn Tyr Pro 

580 



Val Leu Ser Val 

360 



Thr Glu Leu Thr 
375 



Gin Ala Arg Ser 
390 



Phe Arg Thr Asn 
405 



Arg Tyr Asp Ser 



Asp Leu Asp Leu 

440 



His Leu Gin Val 
455 



Asp lie Cys Tyr 
470 



Cys Cys lie Asn 

485 



Leu Leu Leu Thr 



Asp Trp Lys Asp 

520 



Lys Asp Gly Thr 
535 



Pro Val Phe Pro 

550 



Ser Glu Ala Glu 
565 



Ala Gly Asp Pro 



65/73 
He Pro Val Val 



Thr Asp Pro Val 

380 



Glu Lys Ala Phe 
395 



Gin Val He Leu 
410 



Leu Leu Leu Gly 
425 



Leu Leu Glu Leu 



Trp Ser Pro Glu 

460 



Ala Pro Leu Asn 
475 



Ser Leu Leu Gin 
490 



Ala Asn Gin Thr 
505 



His Phe Leu Tyr 



Ala Leu Ala Leu 

540 



Phe Leu Ala He 
555 



Ala Leu He Met 
570 



Arg Leu Ala Gin 
585 



Ala Leu Ala Ala 
365 



Glu Leu Trp Ser 



His Asp Gin His 

400 



Thr Ala Pro Asn 
415 



Pro Lys Asn Phe 
430 



Leu Glu Leu Gin 
445 



Ala Gin Arg Asn 



Pro Asp Asn Thr 

480 



Tyr Phe Gin Asn 
495 



Leu Met Gly Gin 
510 



Cys Ala Asn Ala 
525 



Ser Cys Met Ala 



Gly Gly Tyr Lys 

560 



Thr Phe Ser Leu 
575 



Ala Lys Leu Trp 
590 



Glu Glu Ala Phe Leu Glu Glu Met Arg Ala Phe Gin Arg Arg Met Ala 
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595 600 605 

Gly Met Phe Gin Val Thr Phe Met Ala Glu Arg Ser Leu Glu Asp Glu 
610 615 620 

He Asn Arg Thr Thr Ala Glu Asp Leu Pro He Phe Ala Thr Ser Tyr 
625 630 635 640 

He Val He Phe Leu Tyr He Ser Leu Ala Leu Gly Ser Tyr Ser Ser 

645 650 655 

Trp Ser Arg Val Met Val Asp Ser Lys Ala Thr Leu Gly Leu Gly Gly 

660 665 670 

Val Ala Val Val Leu Gly Ala Val Met Ala Ala Met Gly Phe Phe Ser 
675 680 685 

Tyr Leu Gly He Arg Ser Ser Leu Val He Leu Gin Val Val Pro Phe 
690 695 700 

Leu Val Leu Ser Val Gly Ala Asp Asn He Phe He Phe Val Leu Glu 
705 710 715 720 

Tyr Gin Arg Leu Pro Arg Arg Pro Gly Glu Pro Arg Glu Val His He 

725 730 735 

Gly Arg Ala Leu Gly Arg Val Ala Pro Ser Met Leu Leu Cys Ser Leu 

740 745 750 

Ser Glu Ala He Cys Phe Phe Leu Gly Ala Leu Thr Pro Met Pro Ala 
755 760 765 

Val Arg Thr Phe Ala Leu Thr Ser Gly Leu Ala Val He Leu Asp Phe 
770 775 780 

Leu Leu Gin Met Ser Ala Phe Val Ala Leu Leu Ser Leu Asp Ser Lys 
785 790 795 800 

Arg Gin Glu Ala Ser Arg Leu Asp Val Cys Cys Cys Val Lys Pro Gin 

805 810 815 

Glu Leu Pro Pro Pro Gly Gin Gly Glu Gly Leu Leu Leu Gly Phe Phe 

820 825 830 



Gin Lys Ala Tyr Ala Pro Phe Leu Leu His Trp He Thr Arg Gly Val 
835 840 845 
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Val Leu Leu Leu Phe Leu Ala Leu Phe Gly Val Ser Leu Tyr Ser Met 
850 855 860 



Cys His He Ser Val Gly Leu Asp Gin Glu Leu Ala Leu Pro Lys Asp 
865 870 875 880 



Ser Tyr Leu Leu Asp Tyr Phe Leu Phe Leu Asn Arg Tyr Phe Glu Val 

885 890 895 



Gly Ala Pro Val Tyr Phe Val Thr Thr Leu Gly Tyr Asn Phe Ser Ser 

900 905 910 



Glu Ala Gly Met Asn Ala He Cys Ser Ser Ala Gly Cys Asn Asn Phe 
915 920 925 



Ser Phe Thr Gin Lys He Gin Tyr Ala Thr Glu Phe Pro Glu Gin Ser 
930 935 940 



Tyr Leu Ala He Pro Ala Ser Ser Trp Val Asp Asp Phe He Asp Trp 
945 950 955 960 



Leu Thr Pro Ser Ser Cys Cys Arg Leu Tyr He Ser Gly Pro Asn Lys 

965 970 975 



Asp Lys Phe Cys Pro Ser Thr Val Asn Ser Leu Asn Cys Leu Lys Asn 

980 985 990 



Cys Met Ser He Thr Met Gly Ser Val Arg Pro Ser Val Glu Gin Phe 
995 1000 1005 



His Lys Tyr Leu Pro Trp Phe Leu Asn Asp Arg Pro Asn He Lys 
1010 1015 1020 



Cys Pro Lys Gly Gly Leu Ala Ala Tyr Ser Thr Ser Val Asn Leu 
1025 1030 1035 



Thr Ser Asp Gly Gin Val Leu Asp Thr Val Ala He Leu Ser Pro 
1040 1045 1050 



Arg Leu Glu Tyr Ser Gly Thr He Ser Ala His Cys Asn Leu Tyr 
1055 1060 1065 



Leu Leu Asp Ser Ala Ser Arg Phe Met Ala Tyr His Lys Pro Leu 
1070 1075 1080 
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Lys Asn Ser Gin Asp Tyr Thr Glu Ala Leu Arg Ala Ala Arg Glu 
1085 1090 1095 



Leu Ala Ala Asn He Thr Ala Asp Leu Arg Lys Val Pro Gly Thr 
1100 1105 1110 



Asp Pro Ala Phe Glu Val Phe Pro Tyr Thr He Thr Asn Val Phe 
1115 1120 1125 



Tyr Glu Gin Tyr Leu Thr He Leu Pro Glu Gly Leu Phe Met Leu 
1130 1135 1140 



Ser Leu Cys Leu Val Pro Thr Phe Ala Val Ser Cys Leu Leu Leu 
1145 1150 1155 



Gly Leu Asp Leu Arg Ser Gly Leu Leu Asn Leu Leu Ser He Val 
1160 1165 1170 



Met He Leu Val Asp Thr Val Gly Phe Met Ala Leu Trp Gly He 
1175 1180 1185 



Ser Tyr Asn Ala Val Ser Leu He Asn Leu Val Ser Ala Val Gly 
1190 1195 1200 



Met Ser Val Glu Phe Val Ser His He Thr Arg Ser Phe Ala He 
1205 1210 1215 



Ser Thr Lys Pro Thr Trp Leu Glu Arg Ala Lys Glu Ala Thr He 
1220 1225 1230 



Ser Met Gly Ser Ala Val Phe Ala Gly Val Ala Met Thr Asn Leu 
1235 1240 1245 



Pro Gly He Leu Val Leu Gly Leu Ala Lys Ala Gin Leu He Gin 
1250 1255 1260 



He Phe Phe Phe Arg Leu Asn Leu Leu He Thr Leu Leu Gly Leu 
1265 1270 1275 



Leu His Gly Leu Val Phe Leu Pro Val He Leu Ser Tyr Val Gly 
1280 1285 1290 



Pro Asp Val Asn Pro Ala Leu Ala Leu Glu Gin Lys Arg Ala Glu 
1295 1300 1305 
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Glu Ala Val Ala Ala Val Met Val Ala Ser Cys Pro Asn His Pro 
1310 1315 1320 

Ser Arg Val Ser Thr Ala Asp Asn He Tyr Val Asn His Ser Phe 
1325 1330 1335 

Glu Gly Ser He Lys Gly Ala Gly Ala He Ser Asn Phe Leu Pro 
1340 1345 1350 

Asn Asn Gly Arg Gin Phe 
1355 

<210> 45 

<211> 4471 

<212> DNA 

<213> Mus musculus 



<400> 45 



ggatcacttc 


ctggctctgg 


gatggcagct 


gcctggcagg 


gatggctgct 


ctgggccctg 


60 


ctcctgaatt 


cggcccaggg 


tgagctctac 


acacccactc 


acaaagctgg 


cttctgcacc 


Iz 0 


ttttatgaag 


agtgtgggaa 


gaacccagag 


ctttctggag 


gcctcacatc 


actatccaat 


180 


atctcctgct 


tgtctaatac 


cccagccccg 


ccatgtcaca 


ggtgaccacc 


tggctcttct 


240 


ccagcgcgtc 


tgtccccgcc 


tatacaatgg 


ccccaatgac 


acctatgcct 


gttgctctac 


300 


caagcagctg 


gtgtcattag 


acagtagcct 


gtctatcacc 


aaggccctcc 


ttacacgctg 


360 


cccggcatgc 


tctgaaaatt 


ttgtgagcat 


acactgtcat 


aatacctgca 


gccctgacca 


420 


gagcctcttc 


atcaatgtta 


ctcgcgtggt 


tcagcgggac 


cctggacagc 


ttcctgctgt 


480 


ggtggcctat 


gaggcctttt 


atcaacgcag 


ttttgcagag 


aaggcctatg 


agtcctgtag 


540 


ccgggtgcgc 


atccctgcag 


ctgcctcgct 


ggctgtgggc 


agcatgtgtg 


gagtgtatgg 


600 


ctctgccctc 


tgcaatgctc 


agcgcctggc 


tcaacttcca 




gggaatggcc 


660 


tggctccgct 


ggacatcacc 


ttccacctct 


tggagcctgg 


ccaggccctg 


gcagatggga 


720 


tgaagccact 


ggatgggaag 


atcaaaccct 


gcaatgagtc 


ccagggtgaa 


gactcggcag 


780 


cctgttcctg 


ccaggactgt 


gcagcatcct 


gccctgtcat 


ccctccgccc 


ccggccctgc 


840 


gcccttcttt 


ctacatgggt 


cgaatgccag 


gctggctggc 


tctcatcatc 


atcttcactg 


900 


ctgtctttgt 


attgctctct 


gttgtccttg 


tgtatctccg 


agtggcttcc 


aacaggaaca 


960 


agaacaagac 


agcaggctcc 


caggaagccc 


ccaacctccc 


tcgtaagcgc 


agattctcac 


1020 


ctcacactgt 


ccttggccgg 


ttcttcgaga 


gctggggaac 


aatggtggcc 


tcatggccac 


1080 
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tcactgtctt 


ggcactgtcc 


ttcatagttg 


tgatagcctt 


gtcagtaggc 


ctgaccttta 


1140 


tagaactcac 


cacagaccct 


gtggaactgt 


ggtcggcccc 


taaaagccaa 


gcccggaaag 


1200 


aaaaggcttt 


ccatgacgag 


cattttggcc 


ccttcttccg 


aaccaaccag 


atttttgtga 


1260 


cagctaagaa 


caggtccagc 


tacaagtacg 


actccctgct 


gctagggccc 


aagaacttca 


1320 


gtgggatcct 


atccctggac 


ttgctgcagg 


agctgttgga 


gctacaggag 


agacttcgac 


1380 


acctgcaagt 


gtggtcccat 


gaggcacagc 


gcaacatctc 


cctccaggac 


atctgctatg 


1440 


ctcccctcaa 


accgcataac 


accagcctca 


ctgactgctg 


tgtcaacagc 


ctccttcaat 


1500 


acttccagaa 


caaccacaca 


ctcctgctgc 


tcacagccaa 


ccagactctg 


aatggccaga 


1560 


cctccctggt 


ggactggaag 


gaccatttcc 


tctactgtgc 


caatgcccct 


ctcacgtaca 


1620 


aagatggcac 


agccctggcc 


ctgagctgca 


tagctgacta 


cggggcgcct 


gtcttcccct 


1680 


tccttgctgt 


tgggggctac 


caagggacgg 


actactcgga 


ggcagaagcc 


ctgatcataa 


1740 


ccttctctat 


caataactac 


cccgctgatg 


atccccgcat 


ggcccacgcc 


aagctctggg 


1800 


aggaggcttt 


cttgaaggaa 


atgcaatcct 


tccagagaag 


cacagctgac 


aagttccaga 


1860 


ttgcgttctc 


agctgagcgt 


tctctggagg 


acgagatcaa 


tcgcactacc 


atccaggacc 


1920 


tgcctgtctt 


tgccatcagc 


taccttatcg 


tcttcctgta 


catctccctg 


gccctgggca 


1980 


gctactccag 


atggagccga 


gttgcggtgg 


attccaaggc 


tactctgggc 


ctaggtgggg 


2040 


tggctgttgt 


gctgggagca 


gtcgtggctg 


ccatgggctt 


ctactcctac 


ctgggtgtcc 


2100 


cctcctctct 


ggtcatcatt 


caagtggtac 


ctttcctggt 


gctggctgtg 


ggagctgaca 


2160 


acatcttcat 


ctttgttctt 


gagtaccaga 


ggctgcctag 


gatgcccggg 


gagcagcgag 


2220 


aggctcacat 


tggccgcacc 


ctgggtagtg 


tggcccccag 


catgctgctg 


tgcagcctct 


2280 


ctgaggccat 


ctgcttcttt 


ctaggggccc 


tgacctccat 


gccagctgtg 


aggacctttg 


2340 


ccttgacctc 


tggcttagca 


atcatctttg 


acttcctgct 


ccagatgaca 


gcctttgtgg 


2400 


ccctgctctc 


cctggatagc 


aagaggcagg 


aggcctctcg 


ccccgacgtc 


gtgtgctgct 


2460 


tttcaagccg 


aaatctgccc 


ccaccgaaac 


aaaaagaagg 


cctcttactt 


tgcttcttcc 


2520 


gcaagatata 


cactcccttc 


ctgctgcaca 


gattcatccg 


ccctgttgtg 


ctgctgctct 


2580 


ttctggtcct 


gtttggagca 


aacctctact 


taatgtgcaa 


catcagcgtg 


gggctggacc 


2640 


aggatctggc 


tctgcccaag 


gattcctacc 


tgatagacta 


cttcctcttt 


ctgaaccggt 


2700 


acttggaagt 


ggggcctcca 


gtgtactttg 


acaccacctc 


aggctacaac 


ttttccaccg 


2760 


aggcaggcat 


gaacgccatt 


tgctctagtg 


caggctgtga 


gagcttctcc 


ctaacccaga 


2820 


aaatccagta 


tgccagtgaa 


ttccctaatc 


agtcttatgt 


ggctattgct 


gcatcctcct 


2880 
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gggtagatga 


cttcatcgac 


tggctgaccc 


catcctcctc 


gtggccccca 


taaagatgag 


ttctgtccct 


caacggatac 


actgcatgaa 


ccgcactctg 


ggtcccgtga 


gacccacaac 


tgccctggtt 


cctgaatgat 


acgcccaaca 


tcagatgtct 


atagaacctc 


tgtgaatt tg 


atctcagatg 


gccagattat 


accacaagcc 


cttacggaac 


tcacaggact 


ttacagaagc 


tagcagccaa 


catcacagct 


gaactacgga 


aggtgcctgg 


tcttccctta 


cacgatctcc 


aatgtgttct 


accagcaata 


gaatcttcac 


tcttgctctc 


tgcttcgtgc 


ccacctttgt 


gcctggacat 


acgctcaggc 


atcctcaacc 


tgctctccat 


ccatcggcct 


catggctgtg 


tggggtatca 


gctacaatgc 


tcacggcagt 


gggcatgtct 


gtggagttcg 


tgtcccacat 


gcaccaagcc 


tacccggctg 


gagagagcca 


aagatgctac 


tgtttgctgg 


agtggccatg 


accaacttcc 


cgggcatcct 


cccagcttat 


ccagatt ttc 


ttcttccgcc 


tcaacctcct 


tacacggcct 


ggtcttcctg 


cccgttgtcc 


tcagctatct 


ctctggtact 


ggaggagaaa 


ctagccactg 


aggcagccat 


cacagtaccc 


cttcccggct 


gatgcaaaca 


ccagtgacct 


ccagaattta 


tccctgaaat 


taatgctgct 


agcagctctc 


ttctaatgga 


gtaggagctt 


gtccaggctc 


catggttctt 


gtcttccctc 


tggttgtttc 


caaggcctgg 


ggaaagttgt 


attcttgtcc 


tgaggcagcc 


agcactggcc 


actttgttgt 


atcagattac 


ctcctctgta 


aagagaatat 


cttgagtatt 


caattaaaaa 


ggccatggcc 


tatggcttag 


gcaggaaata 


aagaaaggat 


tctgggataa 


aggacacttg 


ggaacgtgtg 


gtaattagcc 


atgtggcgaa 


atgtagatta 


atataaatgc 


gtctagctat 


atggccaagg 


tatttataaa 


t 
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ctgctgccgc atttataccc 2940 

ttccttcaac tgtctcaaaa 3 000 

agaacagttt cataagtacc 3 060 

taaagggggc ctagcagcgt 3120 

agcctcccag ttcatggcct 3180 

tctccgggca tcccggttgc 3240 

gacagatccc aactttgagg 3300 

cctgacggtt ctccctgagg 3360 

ggtctgctac ctcctactgg 3420 

cattatgatc ctcgtggaca 3480 

tgt.gtccctc atcaaccttg 3540 

tacccggtcc tttgctgtaa 3 600 

tatcttcatg ggcagtgcgg 3660 

catcctgggc tttgctcagg 3720 

gatcaccttg ctgggtctgc 37 80 

ggggccagat gttaaccaag 3840 

ggtctcagag ccttcttgcc 3900 

atgttaacta aggctttaat 3960 

tgcccaaaag tgaccaaaag 4020 

gctgataagg ggccacgagg 4080 

tccagaaaaa aattgctggc 4140 

cataggtccc cgaggccatg 4200 

gtatgggatg tatcacatgt 4260 

gggtgtggaa catccaggag 4320 

gcagtggtac ctgagcacag 4380 

atatctaagt tatgattcta 4440 

4471 



<210> 46 
<211> 25 
<212> DNA 
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<213> 



Artificial sequence 



<220> 



<223> 



primer 



<400> 46 

atgttaggtg agtctgaacc taccc 



25 



<210> 47 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 47 

ggattgcatt tccttcaaga aagcc 25 

<210> 48 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer 

<400> 48 

tatggctctg ccctctgcaa tgctc 25 

<210> 49 

<211> 28 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer 
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<400> 49 

tcagcagcct ctgttccaca tacacttc 28 

<210> 50 

<211> 25 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> primer 

<400> 50 

gttccacagg gtctgtggtg agttc 25 



